voice input

AppWizard
January 15, 2026
OpenAI has launched ChatGPT Translate, a new online translation tool that supports over 50 languages and offers real-time translations with automatic language detection. It allows users to customize the tone, fluency, and complexity of translations. The tool has a dedicated website and hints at future features like image and file translation, though these are not yet available. Currently, it lacks voice input and full website translation capabilities. There is no standalone app or integration with the existing ChatGPT application at this time.
AppWizard
December 27, 2025
Traveling in Japan can be challenging due to language barriers, as English speakers are scarce, even in major cities. The Google Translate app, particularly its voice input feature, proved essential for communication, allowing users to translate spoken Japanese instantly. Google Lens also facilitated understanding written Japanese by translating text captured through the camera. While apps are helpful, learning basic Japanese phrases is recommended for situations when technology fails. Engaging with locals in Japanese is encouraged, as it enhances the travel experience.
AppWizard
December 24, 2025
Google is updating its Gemini app for Android to reduce visual clutter and enhance user engagement. An APK teardown indicates a redesign of the app’s input box from a static sheet to a dynamic, floating pill-shaped bar in version 16.51.52 beta. This floating bar expands when users start typing, improving one-handed operation on larger devices. The update also introduces a ‘Tools’ menu that consolidates options like image upload and voice input, streamlining workflows for power users. Additionally, hints of ‘Gemini Labs’ suggest an experimental section for users to test upcoming features. The redesign aims to make AI interactions feel more natural and accessible while addressing user feedback about previous designs. Mixed reactions to Gemini's automotive rollout indicate some praise for hands-free capabilities, though interface glitches remain a concern. User privacy is emphasized, with guides available for opting out of tracking. Feedback from beta testers suggests improved multitasking capabilities. The redesign aligns with Google’s broader AI strategy, emphasizing intuitive updates and competitive pressures in the AI market. Developers have noted similarities to past Google designs, and the potential rollout timeline is speculative, with expectations for a gesture-based interaction model. Overall, these changes position Gemini as a leading AI assistant, focusing on a decluttered interface and user-centric design.
AppWizard
December 15, 2025
Google has released Android XR SDK Developer Preview 3, which enhances AI Glasses development with two new libraries: Jetpack Projected and Jetpack Compose Glimmer, and expands ARCore for Jetpack XR to include motion tracking and geospatial capabilities. Jetpack Projected allows apps to project XR experiences from host devices to AI Glasses, enabling interaction with the glasses' hardware. Jetpack Compose Glimmer provides UI components for creating augmented experiences, utilizing optical see-through technology. An AI Glasses emulator is also available in Android Studio for developers to preview UI designs. The expanded ARCore capabilities include retrieving planar data, anchoring content, motion tracking, and geospatial pose support. Developers can access these features through Android Studio Canary with the latest emulator version.
Winsage
December 3, 2025
On the International Day of Persons with Disabilities, the Windows Accessibility team emphasizes their commitment to inclusivity and accessibility, guided by the principle of “nothing about us without us.” They collaborate with advisory boards from the disability community to enhance product features. Fluid Dictation on Windows allows users to dictate text seamlessly, correcting grammar and punctuation in real time, and operates offline on Copilot+ PCs. Voice Access has been improved to accommodate diverse communication styles, with features like adjustable wait time, a custom word dictionary, flexible command recognition, enhanced speech pattern recognition, and support for Chinese and Japanese. Narrator and Magnifier now feature human-like voices developed with Azure AI, enhancing the user experience with natural conversation nuances. Recent updates to Narrator in Microsoft Word improve navigation and text drafting, with clearer announcements and concise feedback on spelling and grammar. Additional enhancements include a Screen Curtain for privacy, richer image descriptions, and tools for speech recap and live transcription. Users are encouraged to provide feedback to guide further development, and technical assistance is available through the Disability Answer Desk.
Winsage
December 3, 2025
Microsoft has released a holiday advertisement featuring its 'Hey Copilot' voice input and AI capabilities in Windows 11, set to the song A-Punk by Vampire Weekend. The ad showcases family-oriented scenarios, such as syncing Christmas lights to music and navigating assembly instructions with Copilot's assistance, including a cameo from Santa. Viewer reactions have been mixed, with some expressing skepticism about Copilot's capabilities and suggesting humorous voice commands. Concerns have been raised regarding the disconnect between the ad's portrayal of Copilot's functionality and the actual limitations of the technology, particularly its inability to connect smart home devices to a Windows 11 PC. Users appreciate advancements in voice input technology but feel that the advertisement may oversell its current capabilities, leading to potential disappointment.
AppWizard
December 3, 2025
Gemini is set to unveil a redesigned tool menu that resembles OpenAI's ChatGPT interface, featuring a sliding menu that integrates various functionalities like image and video generation. A new voice input feature will allow continuous recording by pressing and holding the microphone icon. Gemini will enhance its Maps integration with detailed place recommendations and the ability to export curated lists to Google Maps. Additionally, a new Labs icon in the Gemini Live interface suggests potential experimental features may be introduced soon.
AppWizard
November 27, 2025
Google is refining its Circle to Search feature by introducing a follow-up search bar at the bottom of the screen to enhance user interaction. This simplified interface includes a new AI Mode that allows for voice input and context-specific queries. The feature is currently in beta testing and is not yet available to all users. The design aims to make follow-up queries more seamless, allowing users to explore further information without losing sight of initial results. The feature is being tested in the latest beta version of the Google app (version 16.47), with an uncertain timeline for a wider release.
AppWizard
November 18, 2025
xAI’s Grok chatbot has launched a home screen widget for Android devices, allowing users to access core functionalities with a single tap. This feature was developed in response to feedback from the Android community. Users are encouraged to upgrade to version 1.0.75 to utilize features like Chat, Imagine, and Voice. The widget appears as an elongated bar on the home screen and includes quick-access shortcuts for image-based searches and audio input. Customization options allow users to resize the widget while keeping functionalities easily accessible. Early users have reported an issue with the "Voice" button, which sometimes redirects to the main app instead of activating voice input. The development team is aware of this issue and plans to include a fix in the next app update.
Search