image generation

Winsage
February 20, 2025
Microsoft has announced that Notepad and Paint will have certain features locked behind a paid Microsoft 365 subscription. Users can still access Notepad and Paint without a subscription, but they will miss out on the latest features. In Notepad, spell checks will remain available, but features such as AI rewriting of text selections, AI generation of alternative versions of text selections, and AI shortening or lengthening of text selections will require a subscription. In Paint, the Image Creator feature, which uses OpenAI’s Dall-E for image generation, will be behind the paywall, while the background removal tool will remain accessible to all users. The subscription for Microsoft 365 starts at .99 per month or .99 annually.
Winsage
February 9, 2025
Windows 11 Insider Preview Build 22635.4880 (KB5052100) is being released to the Beta Channel, specifically for users who have not transitioned to updates based on Windows 11, version 24H2. This update includes general enhancements and fixes, such as addressing a crash issue on the Settings Home page. Known issues include sluggishness in closing File Explorer and the appearance of enterprise-specific cards on non-managed PCs. Additionally, an update for Paint introduces a Copilot menu with features like Cocreator and Image Creator, accessible on Copilot+ PCs. Windows Insiders in the Beta Channel will receive updates based on Windows 11, version 23H2, and features may evolve or be removed based on feedback.
Winsage
February 5, 2025
Microsoft has introduced a new Copilot button in the Paint app, accessible to all users with the release of Windows 11 Build 26120.3073 (KB5050090) in the Developer Channel of the Insider Program. This update is also available as an optional update for Windows Insiders on the Beta Channel, while those on the Canary and Dev Channels will receive it for Paint version 11.2412.271.0 and higher. The Copilot button consolidates various AI features, including Cocreator, Image Creator, Generative Erase, and Remove Background, each accompanied by descriptions to assist users. Cocreator generates images locally on Copilot+ PCs using the Neural Processing Unit (NPU) based on user prompts. Image Creator allows users to create images on an existing canvas by entering prompts and adjusting creativity levels. Generative Erase enables users to remove unwanted image portions, seamlessly blending the remaining elements. The Remove Background feature automatically eliminates backgrounds from images, with effectiveness depending on contrast and resolution. Using Image Creator requires AI credits, which are included with a Microsoft 365 subscription. The update also includes improvements to Windows Search functionality, allowing natural language searches for OneDrive files.
Winsage
February 4, 2025
Microsoft released a new Windows 11 Insider Preview build (version 26120.3073) that includes enhancements to the Paint application and Windows Search functionality, available to Windows Insider users in the Canary and Dev channels. The Paint app now features a Copilot menu that integrates AI functionalities, including Cocreator for generating AI images from sketches, an Image Creator for text-to-image generation, a Generative Erase tool for removing unwanted objects, and a Remove Background function. These features are organized within the Copilot menu, with the Cocreator feature exclusive to Copilot+ PC users. Additionally, Windows Search has improved semantic indexing to support searching for documents, photos, and files in OneDrive, also limited to Copilot+ PC users.
AppWizard
February 3, 2025
The Gemini app has introduced Gemini 2.0 Flash, which enhances user interactions with rapid responses and improved performance. Users of Gemini Advanced will have a 1M token context window for uploading files up to 1,500 pages and priority access to features like Deep Research and Gems. The app's image generation capabilities have been upgraded to Imagen 3, producing images with greater detail and accuracy. The rollout of 2.0 Flash will begin across Gemini web and mobile applications, including enterprise accounts, while versions 1.5 Flash and 1.5 Pro will remain available for a few weeks.
Winsage
December 26, 2024
Copilot+ PCs are the first personal computers to run Small Language Models (SLM) directly on-device, allowing for quicker interactions without relying on the cloud. Microsoft has introduced the AI Dev Gallery, which offers over 25 samples for developers to integrate on-device AI features into applications on Windows 10 and 11. The gallery requires building the project in Visual Studio, needing at least 20GB of storage and a multi-core CPU. A GPU with 8GB VRAM is recommended for heavier models but not mandatory for lighter applications. The app has two operational modes: Sample and Models. Testing models for image generation typically requires around 5GB of bandwidth, while a smaller image upscaling model under 100MB was successfully tested, completing the process in under 30 seconds with peak RAM usage of 1GB. The resulting image resolution was 9272x4900, but clarity issues were noted, especially with text. The application lacks features for previewing images in larger formats or downloading outputs directly. A model named Detect Human Pose was able to identify positions within images, including desktop screenshots. Substantial storage and robust CPUs are necessary for effective model accommodation, and the practicality of downloading large models for niche use cases is questioned.
AppWizard
December 17, 2024
Google has begun rolling out its Gemini 2.0 Flash AI model to the Android version of its chatbot, following the initial release on December 12. The new model includes a model switcher feature for users to select their preferred AI model. The Gemini 2.0 family offers enhanced capabilities, including image generation support, and the Flash variant is the smallest and fastest model available in an experimental preview. In the Google app for Android version 15.50 beta, the model information has become interactive, allowing free users to switch between the 1.5 Flash and 2.0 Flash models, while subscribers can access the 1.5 Pro model. The Gemini 2.0 Flash model is currently in early preview and may have functionality issues. Internal testing has shown that it outperforms the 1.5 Pro model in various benchmarks.
AppWizard
December 17, 2024
Google has launched an AI experiment called Whisk for Labs testers, which allows users to generate images by uploading images that define subject, scene, and style. Whisk uses Gemini and Imagen 3 to extract characteristics from the uploaded images, creating variations in attributes like height, hairstyle, and skin tone. It features a "review and edit" option for users to refine generated images and provides a detailed description of the creation process. As of December 16, U.S. Labs testers can sign up to explore Whisk. Additionally, Google has updated Imagen 3 and Veo 2. Imagen 3 now offers brighter and better-composed images with improved adherence to prompts, rolling out globally in ImageFX within Google Labs. Veo 2 has been enhanced to create high-quality videos in 4K resolution, allowing for detailed descriptions and improved understanding of expressions and movements. The updates for Veo 2 are being rolled out in Google Labs' VideoFX, with plans to integrate it into YouTube Shorts and other products in the future.
Winsage
December 13, 2024
Windows Recall is a feature introduced in the Windows 11, version 24H2 update, exclusive to Copilot+ PCs, which captures snapshots of the user's screen at regular intervals and uses on-device AI to analyze the content. Microsoft has implemented a setting to filter out sensitive information from these snapshots, aiming to prevent the capture of data from applications or websites that handle sensitive information. However, a report indicated that Windows Recall still captured sensitive financial information, including credit card numbers, despite the filtering setting being enabled. Testing showed inconsistent results, as sensitive fields were captured in some contexts but not in others. Windows Recall is currently in beta and available through Microsoft's Windows 11 preview program, with Microsoft actively seeking user feedback to improve the feature before its official launch.
Search