visual assistance

Winsage
April 6, 2025
Microsoft introduced Copilot Vision during an event celebrating its 50th Anniversary. This feature allows users to point their camera at objects for real-time identification by AI, integrating OpenAI's GPT models for enhanced memory, search, personalization, and visual capabilities. Currently available on the Windows Desktop app, Copilot Vision can recognize open applications without continuous monitoring. It adapts its responses based on the specific application in use, such as providing contextually relevant guidance in Blender 3D and visually indicating tools in Clipchamp. More advanced features are anticipated in the future, but no specific timeline has been provided.
Winsage
October 2, 2024
Microsoft has introduced updates to its Copilot platform, including tools like Copilot Voice, Think Deeper, and Copilot Vision, which will initially be available only to select testing groups through Copilot Labs. Copilot Labs is designed for experimental features, allowing user feedback for product enhancement. Copilot Vision enables the Copilot in Microsoft Edge to visually interpret screen content and provide real-time voice assistance, with privacy measures in place. The Think Deeper feature allows for more detailed responses to complex inquiries. Access to Copilot Labs is limited to Copilot Pro users who subscribe at a monthly rate, while Google Labs offers a no-cost alternative for experimenting with AI features.
Search