Microsoft unveiled Copilot Vision, a feature that can ‘understand’ what appears on the screen
Copilot Vision can analyze text and images on web pages to answer queries like “What’s the recipe for this lasagna?”. For now, the feature only works in the Microsoft Edge browser and only on certain websites.
Source
Google launched PaliGemma 2 vision-language model
The new model can generate long captions for images that describe actions, emotions, and narratives of the scene. It also has “accurate optical character recognition and understanding of the structure and content of tables in documents”.
Source
OpenAI launched the o1 model with image uploads and analysis
The o1 represents a significant evolution in reasoning model capabilities, including better handling of complex tasks, image-based reasoning, and enhanced accuracy. The company also confirmed there will be a new $200 monthly subscription.
Source
Google will add “Expressive Captions” on Android
The feature, which is built atop Google’s existing Live Captions, uses artificial intelligence to add “emotion to your captions” to make it more useful.
Source
New products
OnePlus Ace 5 Pro
Ace 5 Pro will have a 6.82-inch OLED screen, Snapdragon 8 Elite processor, 24 GB of RAM and 1 TB of storage. It will come with a 6,000mAh battery.
Source
Realme 14x
The phone will have 6.67-inch screen, 6GB or 8GB of RAM and up to 256GB of storage. It will come with a 6,000mAh battery.
Source


