speech-to-text

Winsage
November 20, 2025
Microsoft has introduced a range of artificial intelligence features in Windows 11, marking a departure from Windows 10. The Calendar flyout feature, which was absent since October 2021, will return, allowing users to access it by clicking the date and time stamp in the bottom right corner of the screen. The new Copilot chatbot will integrate AI capabilities into various text boxes across the operating system, utilizing the neural processing unit (NPU) of modern PCs to enhance efficiency. The taskbar will include the "Ask Copilot" function and a new Researcher app for facilitating research. Users can opt out of these taskbar apps if desired. The "Fluid dictation" tool will convert speech to text, while Microsoft 365 applications will use AI for email summaries and automatic alt-text for images. An "Agent Mode" will enable users to create documents and spreadsheets based on simple prompts. At the Ignite 2025 conference, Microsoft emphasized its vision of Windows 11 as an "agentic" operating system capable of executing complex tasks autonomously, although this raises concerns about data security.
Winsage
July 12, 2025
In preview build 27898 of Windows 11, Microsoft introduces features such as the automatic shrinking of Taskbar items when there are many pinned applications, a revamped pop-up system for application permissions, and the ability to add custom words to the speech-to-text dictionary. The upcoming Windows 11 25H2 update is expected to launch in the coming months and will share a servicing branch with the 24H2 version, featuring a unique deployment strategy where necessary code is installed but remains inactive until the official update is applied.
AppWizard
May 5, 2025
The Gemini app has introduced a new Android homescreen widget that is now widely available to users. The widget is highly resizable, incorporates Dynamic Color, and resembles the revamped widgets of Google Keep and Drive. It features a sparkle icon that launches the app and keyboard, and includes buttons arranged within a rounded rectangle, with a Gemini Live shortcut. Users can expand the widget to access additional functionalities like speech-to-text and the Gemini camera, and it can be configured in various layouts, including a search bar and a grid of shortcuts. The widget is part of Gemini app version 1.0.751104895 and is compatible with Android 10 and newer versions. Users can add it to their homescreen by updating the app and selecting Widgets.
Winsage
April 25, 2025
Microsoft has launched the AI Dev Gallery, an open-source application for Windows developers aimed at integrating AI functionalities into projects. Initially introduced as a concept in December 2024, it was officially showcased on April 22. The platform provides resources such as sample applications, model downloads, and exportable source code, and is available for download in preview format from the Microsoft Store. Key features include the ability to experiment with AI applications offline and a variety of interactive samples, including Retrieval-Augmented Generation, chat interfaces, object detection, text-to-speech/speech-to-text conversion, and document summarization and analysis, all designed to run locally on developers' machines.
AppWizard
April 23, 2025
Video games are increasingly incorporating accessibility options to cater to a diverse audience. A new accessibility-support questionnaire has been introduced by Valve for developers on Steam, allowing them to specify various accessibility features in their games. The questionnaire includes options such as adjustable text size, narrated menus, and colorblind options. Valve plans to display these selected accessibility features on game store pages and will allow players to filter search results based on these features. While participation in the system is not mandatory, it is encouraged to enhance accessibility. The listed accessibility options cover gameplay, audio, visual, and input categories, including adjustable difficulty, custom volume controls, adjustable text size, and various input methods.
Winsage
March 23, 2025
Microsoft's Windows 11 includes several accessibility features aimed at enhancing user experience. - Voice Access allows users to control their PCs with voice commands, operating in three modes: default, command-only, and dictation. It functions without an internet connection. - Live Captions automatically generate captions for any audio playing on the PC, aiding users in noisy environments or while multitasking. - Focus helps users minimize distractions by customizing notification settings and can be programmed to activate during specific activities. - Narrator reads text aloud from the screen, useful for managing extensive texts, with customizable settings for speed, pitch, and voice. - On-Screen Keyboard provides a virtual keyboard for input via mouse or touchscreen, customizable for usability. - Mouse pointer options allow users to adjust the size and color of the cursor and enable pointer trails for better visibility. Custom cursor designs can also be downloaded from the Microsoft Store.
AppWizard
December 26, 2024
In 2024, Made by Google released multiple updates for the Pixel phone, introducing new features and enhancing existing functionalities across six updates. - January Feature Drop: Launched with the Pixel 8 and 8 Pro, it included the Pixel Thermometer app for forehead temperature readings, Circle to Search, and the rebranding of Quick Share from Nearby Share. - March Feature Drop: Introduced with Android 14 QPR2, it featured an expandable Bluetooth Quick Settings Tile, a Material You volume slider, new casting options, and a "Hello?" button for the Call Screen feature. The Pixel Tablet received the Gboard Voice Toolbar. - June Feature Drop: Marked by the early launch of the Pixel 8a, it introduced Audio Emoji, Display Port Support for external screens, Gemini Nano technology, and camera improvements for various Pixel models. The Android 14 QPR3 update focused on minor tweaks. - Pixel 9 Series Launch: Debuted alongside Android 14, backporting features from the Android 15 Beta. Introduced three new applications: Pixel Screenshots, Pixel Studio, and Pixel Weather, along with Gemini Live and on-device Call Notes. - Android 15/October Drop: Introduced Android 15 with a Private Space option, Predictive Back functionality, a redesigned screenshot interface, underwater photography capabilities for the Pixel 9 series, and an enhanced Adaptive Vibration feature. - December Drop: Included Android 15 QPR1 with a Material You redesign for Settings, charging optimization, enhancements to the Pixel Screenshots app, a clear voice feature for the Pixel Recorder, broader rollout of contextual replies for Gemini, and Dual Screen Portrait Mode for foldable devices. Google extended Android OS updates for the Pixel 6 and 7 series.
Winsage
December 26, 2024
Users experiencing issues with their microphone unmuting itself on Windows can try the following solutions: 1. Check the physical mute button for functionality. 2. Review the microphone app settings for auto-unmute features. 3. Disable exclusive mode by accessing sound properties and unchecking the option for applications to take exclusive control. 4. Adjust the Communications setting in sound properties to "Do nothing." 5. Test the microphone on a different PC to determine if the issue is specific to the current configuration. 6. Contact microphone support for additional assistance or firmware updates. Additionally, third-party software like Zoom or Skype can alter microphone settings, and voice-activated applications may automatically unmute the microphone to listen for commands.
Winsage
December 14, 2024
In 2024, Microsoft introduced the "Copilot+ PC" branding for AI-capable laptops, while Apple launched Apple Intelligence. These developments have led to mixed outcomes, with features like real-time translations and on-device speech-to-text being beneficial, but others, such as Windows Recall, still proving their value. By 2025, mainstream developers are expected to integrate on-device AI into Windows applications, influencing consumer purchasing decisions. The term "TOPS" (Trillions of Operations Per Second) is becoming important for evaluating the AI performance of Windows laptops, with a minimum of 40 TOPS required for Microsoft's "Copilot PC+" designation. Qualcomm's Copilot+ PCs reported around 45 TOPS, significantly higher than Intel's 11 TOPS. By the end of 2024, premium Windows laptops are expected to see a three- to four-fold increase in NPU performance compared to 2023 models. Analysts speculate further performance improvements may occur towards the end of 2025. Despite the potential for a two- to three-fold enhancement in on-device AI performance, experts caution against overemphasizing TOPS figures, which may not accurately reflect real-world performance. The lack of a unified API for leveraging NPU capabilities in Windows complicates matters for users of Copilot+ laptops without Qualcomm chips. Although AMD and Intel have released competitive chips, Qualcomm currently holds an advantage with exclusive support for certain applications. Microsoft is promoting its low-level machine learning API (DirectML) and the Windows Copilot Runtime, which may enhance the Copilot+ PC ecosystem. While cloud-based AI solutions remain an option, the cost of these services is expected to rise, making on-device AI more appealing. The introduction of ChatGPT Pro highlights the financial implications of cloud access compared to on-device NPU usage, which incurs no additional costs. The pace of on-device AI adoption in Windows' software ecosystem is anticipated to accelerate in 2025.
Search