multimodal capabilities

AppWizard
February 4, 2025
AI-powered search engine Perplexity has launched a new feature called Perplexity Assistant, available for Android devices, which integrates reasoning, search capabilities, and app functionalities. The assistant can perform multi-app actions, such as hailing rides and searching for songs, and can set reminders by creating calendar entries. It utilizes the phone's camera for contextual inquiries and maintains context across various tasks, like researching restaurants and making reservations. The assistant is initially free for users in 15 languages. CEO Aravind Srinivas acknowledged that some features may not perform as expected and improvements are planned. Perplexity has also introduced Sonar, an API service for enterprises, and acquired Read.cv. Founded in 2022, Perplexity has raised over 0 million in funding and processes over 100 million queries weekly. The company faces legal challenges from publishers, including lawsuits from News Corp and a cease and desist order from The New York Times, but emphasizes its commitment to respecting publisher content through a revenue-sharing program.
AppWizard
February 3, 2025
Perplexity AI has launched a new Android app called the Perplexity Assistant, available on the Google Play Store. The app is designed to assist with various tasks through voice, text, and camera interactions, and it can converse in 15 languages. It utilizes Perplexity’s proprietary search engine to provide real-time web information and maintain context across multiple tasks. Users can perform activities such as booking rides, identifying objects, and making restaurant reservations through voice commands. The app is free and aims to integrate Perplexity’s AI into users' daily workflows. Perplexity has also introduced an API called Sonar for businesses and acquired the professional social media platform Read.cv.
AppWizard
December 5, 2024
Google is developing an 'AI Mode' for its Search app on Android, allowing users to interact in a conversational manner. This feature, discovered in an APK teardown, will enable voice inputs and the submission of photos and videos, creating a more intuitive search experience. The AI Mode will be accessible through a dedicated tab represented by a 'wink' icon in the app. This development follows Google's earlier integration of generative AI into Search, including the Search Generative Experience and AI Overviews, which have been refined based on user feedback.
Winsage
November 20, 2024
Microsoft has introduced new services and products to enhance its AI agent portfolio at the Ignite 2024 conference, including significant upgrades to Copilot Studio with improved knowledge sources and tuning capabilities. The autonomous agents in Copilot Studio, currently in public preview, now feature multimodal capabilities for voice and image analysis. Updated security measures have been implemented, including encryption and data loss prevention, to ensure data protection. Microsoft plans to roll out autonomous capabilities in Copilot Studio by November. A Capgemini survey indicates that over 80% of executives intend to integrate AI agents within the next three years, with Toyota Motor Corporation already using generative AI agents. Gartner's Avivah Litan warned that by 2028, one in four enterprise breaches may be linked to AI agent misuse. KPMG is exploring AI agents but prioritizes establishing security measures before production. The deployment of agentic AI will require increased computing capacity, prompting Microsoft to develop customized chips and an Azure Boost DPU for enhanced security and workload optimization. Additionally, the Azure Integrated Hardware Security Module has been created to improve data center security.
Search