optical character recognition

Winsage
February 22, 2026
Windows 11 operates on nearly 70% of the world's desktops. It features native extraction for compressed files, allowing users to extract files directly from the context menu without third-party applications. The introduction of tabbed browsing in File Explorer helps manage files more effectively, reducing clutter. Snap assist offers layout options for window arrangement, facilitating multitasking. Users can create separate virtual desktops, each customizable with unique wallpapers, to organize different workspaces. Windows 11 includes native screen recording capabilities and an optical character recognition (OCR) feature for extracting text from images and screenshots.
AppWizard
February 17, 2026
FairScan is a free and open-source scanning application designed for Android users, allowing them to photograph, crop, and compile multipage documents into a single PDF file. The app prioritizes user privacy and simplicity, avoiding intrusive ads and questionable privacy practices common in other scanning apps. Users can download FairScan from the Google Play Store or F-Droid, and the scanning process involves capturing images of documents in a well-lit area, with the option to add additional pages. Scanned documents can be exported as a single PDF or multiple JPEG files. While FairScan lacks features like post-capture editing and optical character recognition, it effectively serves its primary purpose without unnecessary distractions.
AppWizard
January 30, 2026
In 2026, faxing remains prevalent in industries like healthcare, real estate, and law, despite advancements in technology. Modern Android fax applications have replaced bulky fax machines, allowing users to send documents quickly by snapping photos or uploading PDFs. These apps offer features such as digital signatures, cloud storage, and security measures, making them ideal for travelers and remote workers. The Municorn Android Fax App is highlighted for its HIPAA compliance and user-friendly interface. Over 30% of healthcare providers still use faxing to meet compliance requirements, with businesses sending billions of pages annually. Modern fax apps eliminate issues like jammed paper trays and busy signals, enabling users to send documents from various locations. A small clinic reported saving hours weekly by using on-site scanning instead of visiting a fax center. Seven top Android fax apps for 2026 include Municorn Fax App, Fax.Plus, iFax, eFax, Simple Fax, Genius Fax, and Tiny Fax, each with unique features catering to different user needs. Many apps now incorporate AI for scan quality improvement and offer Optical Character Recognition (OCR) for searchable text. Security is crucial, especially for compliance, and users are advised to check for HIPAA compliance when handling sensitive information.
AppWizard
November 22, 2025
The 2018 text adventure game "You Are Jeff Bezos" highlights Jeff Bezos's net worth of 6 billion at the time, which has since surpassed 0 billion, prompting discussions on wealth distribution. Jmail is a new platform that allows users to explore over 2,000 emails related to Jeffrey Epstein in a mock Gmail interface, based on documents released by the US House Oversight Committee. Co-creators Walz and Luke Igel used AI technology for optical character recognition to make the emails accessible. Each email is linked to a legitimate document for transparency. Jmail includes a crowdsourced feature for users to highlight notable messages, with some emails receiving significant attention, such as inquiries about Trump and Denmark's financial status.
Winsage
November 10, 2025
Microsoft has transformed the Snipping Tool into a multifaceted application, adding features like video creation and Optical Character Recognition (OCR) for text extraction from images. Recently, new text editing capabilities were discovered, allowing users to insert text when editing screenshots. This feature was demonstrated by a user on X, and while Microsoft has not officially commented on it, the potential applications include simple annotations and creative projects. Windows Insiders are expected to be the first to access these new tools, although a release timeline is not yet clear.
AppWizard
October 13, 2025
Security researchers have identified a 12-year-old data-stealing attack known as Pixnapping, which targets web browsers to extract sensitive information from Android devices. This attack allows a rogue Android application to access and leak information from various apps, including Google Maps, Signal, and Venmo, as well as websites like Gmail, and can capture two-factor authentication codes from Google Authenticator. The attack utilizes a hardware side channel to access screen display pixels, employing techniques inspired by earlier research on timing attacks. A collaborative team from institutions like UC Berkeley and Carnegie Mellon University developed the modern iteration of Pixnapping, which will be presented at the 32nd ACM Conference on Computer and Communications Security. The Pixnapping framework enables a malicious app to push pixels into the rendering pipeline and read them by overlaying semi-transparent Android Activities. The attack systematically measures rendering times to infer pixel colors, allowing for the recovery of data through optical character recognition. Researchers successfully demonstrated Pixnapping on Android versions 13 to 16 across devices like the Google Pixel series and Samsung Galaxy S25. The attack does not require special permissions and exploits how the Mali GPU implements data compression, resulting in data-dependent rendering times. Pixnapping leaks only 0.6 to 2.1 pixels per second, which is still sufficient to recover Google Authenticator codes. Google has issued a patch for the underlying vulnerability tracked as CVE-2025-48561, with another patch planned for December, although there has been no evidence of exploitation in the wild. Despite attempts to mitigate Pixnapping, researchers have identified a workaround and suggest limiting an attacker's ability to compute on victim pixels as an effective strategy. They also discovered methods for attackers to identify all installed apps on a device, a capability restricted since Android 11 for privacy reasons, with Google indicating that fixing this issue may not be feasible.
Winsage
September 18, 2025
Click To Do utilizes optical character recognition (OCR) technology to make screens interactive by allowing users to select text as if it were tangible. Selecting an email address offers the option to “Send email,” while highlighting a website URL provides the choice to “Open website.” When more than ten words are selected, users can access actions powered by the Phi Silica language model on Copilot+ PCs, enabling text summarization, bulleted list generation, and content rewriting. The platform also features an “Ask Copilot” option to send selected text to Microsoft’s Copilot AI chatbot and a “Draft with Copilot in Word” option for initiating Word documents with AI assistance. The functionality is available only to users of Microsoft’s Copilot for home use or Microsoft 365 Copilot for business applications.
Search