conversational AI

Winsage
November 1, 2024
OpenAI launched the Advanced Voice feature in May during the GPT-4o release, achieving response times of 232 milliseconds and an average of 320 milliseconds. The rollout began in September for ChatGPT Plus and Team subscribers in the U.S. The feature expanded to users in the EU, Switzerland, Iceland, Norway, and Liechtenstein. Users can access it by downloading the latest version of the ChatGPT app. Advanced Voice is also available in the macOS and Windows desktop applications, with a daily usage limit. Recent enhancements include five new voices and features for custom instructions and conversation memory. During DevDay 2024, OpenAI introduced the Realtime API for developers, with pricing based on text and audio token usage.
AppWizard
October 3, 2024
Google is introducing Gemini, an AI that aims to enhance its capabilities beyond traditional chatbots by integrating with various Google applications like Google Keep, Google Calendar, and Tasks, in addition to its current functionality with Gmail, Maps, and YouTube. This integration will allow users to transfer information between apps seamlessly, such as retrieving recipes from Gmail and adding ingredients to Google Keep. Gemini will also utilize the phone's camera to capture images for reminders in Google Calendar. The rollout of these features is expected in the coming weeks, and Gemini has begun to take on functions previously managed by Google Assistant. Google has announced enhancements to Gmail's summary cards and "Happening soon" features, which will provide timely updates based on email content. Additionally, Gemini will support over 40 languages, starting with French, Spanish, German, Portuguese, and Hindi, and will allow conversations in two languages simultaneously.
AppWizard
September 18, 2024
Google has launched Gemini Live, a voice AI chatbot available for free to all Android users. Initially introduced during the Pixel 9 launch in August, it was first accessible only to Gemini Advanced subscribers. Now, it can be used by anyone with the Gemini app or its overlay on Android devices. OpenAI's Advanced Voice Mode, demonstrated in May, has had limited availability due to computational constraints. Gemini Live allows users to engage in natural conversations, pause discussions, and continue dialogues while their phones are locked. Google plans to integrate Gemini with various applications, including Keep, Tasks, Utilities, Calendar, and YouTube Music. The integration enables users to access the AI assistant at any time and interact with applications like Gmail and Google Messages. Google has also expanded Gemini AI's reach in India, focusing on multilingual capabilities for cross-border eCommerce. Tim Peters from Enghouse Systems noted the potential of multilingual AI chatbots for small and midsize businesses to facilitate global customer connections through real-time translations.
AppWizard
September 3, 2024
Google is working on integrating its conversational AI, Gemini Live, into Android Auto, as indicated by code strings found in the latest version, v12.8.14. Users may soon be able to start conversations with the AI by selecting an option. Gemini Live, launched for Gemini Advanced subscribers on August 13, is designed to be interactive and responsive, capable of understanding complex inquiries. Android Auto currently features AI-driven tools for road safety and convenience, such as AI-generated summaries for text messages and easy ETA messaging. The integration of Gemini Live could enhance navigation and communication for drivers, although development is still in the early stages.
Search