Daily News – 2nd of March 2023
Microsoft unveiled an AI model that understands images and can solve visual puzzles The multimodal model (MLLM), called Kosmos-1, can perceive general modalities, follow instructions, learn in context, and generate outputs. It has shown promising capabilities on various tasks like visual QA, perception-language tasks and vision tasks. 𝘚𝘰𝘶𝘳𝘤𝘦 OpenAI launched Whisper API for speech-to-text transcription…