speech-to-text

AppWizard
December 26, 2024
In 2024, Made by Google released multiple updates for the Pixel phone, introducing new features and enhancing existing functionalities across six updates. - January Feature Drop: Launched with the Pixel 8 and 8 Pro, it included the Pixel Thermometer app for forehead temperature readings, Circle to Search, and the rebranding of Quick Share from Nearby Share. - March Feature Drop: Introduced with Android 14 QPR2, it featured an expandable Bluetooth Quick Settings Tile, a Material You volume slider, new casting options, and a "Hello?" button for the Call Screen feature. The Pixel Tablet received the Gboard Voice Toolbar. - June Feature Drop: Marked by the early launch of the Pixel 8a, it introduced Audio Emoji, Display Port Support for external screens, Gemini Nano technology, and camera improvements for various Pixel models. The Android 14 QPR3 update focused on minor tweaks. - Pixel 9 Series Launch: Debuted alongside Android 14, backporting features from the Android 15 Beta. Introduced three new applications: Pixel Screenshots, Pixel Studio, and Pixel Weather, along with Gemini Live and on-device Call Notes. - Android 15/October Drop: Introduced Android 15 with a Private Space option, Predictive Back functionality, a redesigned screenshot interface, underwater photography capabilities for the Pixel 9 series, and an enhanced Adaptive Vibration feature. - December Drop: Included Android 15 QPR1 with a Material You redesign for Settings, charging optimization, enhancements to the Pixel Screenshots app, a clear voice feature for the Pixel Recorder, broader rollout of contextual replies for Gemini, and Dual Screen Portrait Mode for foldable devices. Google extended Android OS updates for the Pixel 6 and 7 series.
Winsage
December 26, 2024
Users experiencing issues with their microphone unmuting itself on Windows can try the following solutions: 1. Check the physical mute button for functionality. 2. Review the microphone app settings for auto-unmute features. 3. Disable exclusive mode by accessing sound properties and unchecking the option for applications to take exclusive control. 4. Adjust the Communications setting in sound properties to "Do nothing." 5. Test the microphone on a different PC to determine if the issue is specific to the current configuration. 6. Contact microphone support for additional assistance or firmware updates. Additionally, third-party software like Zoom or Skype can alter microphone settings, and voice-activated applications may automatically unmute the microphone to listen for commands.
Winsage
December 14, 2024
In 2024, Microsoft introduced the "Copilot+ PC" branding for AI-capable laptops, while Apple launched Apple Intelligence. These developments have led to mixed outcomes, with features like real-time translations and on-device speech-to-text being beneficial, but others, such as Windows Recall, still proving their value. By 2025, mainstream developers are expected to integrate on-device AI into Windows applications, influencing consumer purchasing decisions. The term "TOPS" (Trillions of Operations Per Second) is becoming important for evaluating the AI performance of Windows laptops, with a minimum of 40 TOPS required for Microsoft's "Copilot PC+" designation. Qualcomm's Copilot+ PCs reported around 45 TOPS, significantly higher than Intel's 11 TOPS. By the end of 2024, premium Windows laptops are expected to see a three- to four-fold increase in NPU performance compared to 2023 models. Analysts speculate further performance improvements may occur towards the end of 2025. Despite the potential for a two- to three-fold enhancement in on-device AI performance, experts caution against overemphasizing TOPS figures, which may not accurately reflect real-world performance. The lack of a unified API for leveraging NPU capabilities in Windows complicates matters for users of Copilot+ laptops without Qualcomm chips. Although AMD and Intel have released competitive chips, Qualcomm currently holds an advantage with exclusive support for certain applications. Microsoft is promoting its low-level machine learning API (DirectML) and the Windows Copilot Runtime, which may enhance the Copilot+ PC ecosystem. While cloud-based AI solutions remain an option, the cost of these services is expected to rise, making on-device AI more appealing. The introduction of ChatGPT Pro highlights the financial implications of cloud access compared to on-device NPU usage, which incurs no additional costs. The pace of on-device AI adoption in Windows' software ecosystem is anticipated to accelerate in 2025.
Winsage
December 12, 2024
Microsoft has updated Windows 11 to version 24H2 with the KB5048667 update, removing a compatibility block for USB scanners using the eSCL scan protocol. This change allows affected systems to install the update more smoothly. The update also modifies the date format on the taskbar, includes critical security fixes, and introduces several new features such as: - Rebranding of "Tailored Experiences" to "Personalized offers." - A shortened date and time format in the system tray, with an option to revert to the traditional format. - Access to jump lists for pinned apps in the Start menu. - Customizable touchscreen edge gestures. - Hiding of the IME toolbar in full-screen mode for Chinese or Japanese typing. - Sharing content directly to Android devices from File Explorer. - New placeholder messages in Dynamic Lighting Settings when no compatible device is connected. - Enhancements to speech-to-text and text-to-speech functionalities. - New functions in Narrator scan mode for improved accessibility. The update can be accessed via Windows Update or downloaded from the Microsoft Update Catalog.
Winsage
December 11, 2024
Microsoft has released a new preview build for Windows Insiders, designated as 27764, in the Canary Channel. Key enhancements include: - Start Menu: Users can access jump lists by right-clicking pinned apps. - Dynamic Lighting: A placeholder message appears when no compatible devices are connected, and new directional options have been added to the Wave and Gradient effects. - Input: The IME toolbar will be hidden in full-screen mode for Chinese or Japanese typing. - Narrator: New shortcuts allow users to skip links and jump to lists in scan mode. - Speech in Windows: Improvements to speech-to-text and text-to-speech functionalities, with prompts for manual language file updates. Resolved issues include: - A bugcheck issue causing PAGEFAULTINNONPAGEDAREA errors. - File Explorer improvements to prevent hanging with numerous media files. - Fixes for widget text overlapping on secondary monitors. - Resolution of mouse cursor visibility issues when pointer trails are enabled. - Corrections for HDDs being misidentified as SSDs in Task Manager. - Addressing lag and screen tearing on secondary monitors. - Fix for Excel hanging when opening certain files. Known bugs include: - Issues with Windows Hello PIN and biometrics for users transitioning to the Canary Channel. - Rollbacks during installation of Canary builds. - Ongoing work to resolve issues with accent-colored window borders, window shadows, and animations.
Winsage
December 10, 2024
Microsoft has released two cumulative updates for Windows 11: KB5048667 for version 24H2 and KB5048685 for version 23H2. These updates are mandatory and include December 2024 Patch Tuesday security enhancements. The build number for Windows 11 24H2 will change to 26100.2605, and for 23H2 to 226x1.4602. Key changes include the removal of the bell icon from the taskbar, a shortened date and time display, and support for jump lists in the Start menu. New features include: - A section for touchscreen edge gestures in Settings. - The IME toolbar will hide in full-screen mode for Chinese or Japanese typing. - Users can share content to an Android device via File Explorer if Phone Link is configured. - New directional options for the Wave effect in Dynamic Lighting Settings. - Administrative privileges for jump list items when holding Shift and CTRL. - Enhancements to speech-to-text and text-to-speech functionalities. - Fixes for excessive spacing in File Explorer, search box truncation, app window clustering, Mica rendering issues, lag on secondary displays, pointer location circles, clipboard history accuracy, and new functions in Narrator scan mode. The updates are being rolled out gradually.
AppWizard
December 4, 2024
Google is enhancing its Recorder app with a new "clear voice" feature to minimize background noise and improve voice clarity. The app has also introduced a "summarize" function on the Pixel 8, allowing users to obtain concise summaries of audio recordings. The Recorder app debuted on Chromebooks in November, offering functionalities like speech-to-text generation and title suggestions. It utilizes Google's advanced large language model (LLM) for improved user experience. Pixel devices now have a "Transcribe again" tool for re-transcribing recordings in 42 languages, with cloud processing for quick transcription results.
Search