multimodal Archives

AppWizard

March 19, 2026

ChatGPT Android App Hints At Sora Video Integration

OpenAI's generative video model, Sora, is likely to be integrated into the ChatGPT Android app, as indicated by discoveries in the beta version 1.2026.076. Testers found in-app text suggesting end-to-end video generation capabilities, allowing users to convert text and images into videos with dialogue, soundtracks, and customizable styles. The language used in the app is polished and consumer-ready, indicating a transition towards user-facing integration. Previous reports have indicated OpenAI's intention to incorporate Sora's video capabilities into ChatGPT, consolidating multimodal creation within a single platform. If integrated, users could transform text prompts and images into short videos, with options for voiceovers and music, facilitating easy sharing on social media. OpenAI's demonstrations have shown Sora's ability to create intricate 1080p videos, potentially redefining ChatGPT into a mobile video studio. The integration would likely handle intensive tasks in the cloud, with possible limitations on file size and resolution for free users. The integration of Sora into ChatGPT's Android app would provide access to a large user base, enhancing the mainstream adoption of AI video creation. The competitive landscape includes rivals like Runway and Google, all developing video capabilities. The introduction of mobile video generation raises challenges such as misinformation and copyright issues, prompting OpenAI to emphasize safety measures and content provenance strategies. While the beta strings do not confirm a launch date, features typically undergo final refinements late in development. Indicators to watch for include a new “Video” option in input modes and prompts for camera roll access. If Sora is launched in ChatGPT for Android, it will mark a significant shift for the app, making video creation an integral part of the user experience.

AppWizard

March 19, 2026

Google Labs’ Stitch is a design canvas that turns your voice into an app

Google has launched an upgraded version of Stitch, a tool from Google Labs aimed at improving user interface (UI) design through a concept called “vibe design,” which allows users to create designs using simple text prompts. Stitch utilizes Google’s Gemini models to interpret both text and visual inputs, enabling real-time design adjustments. It can produce editable design files and front-end code, integrating into existing engineering workflows. Currently in the experimental phase, Stitch aims to democratize design, allowing individuals without extensive expertise to contribute to UI development. Concerns have been raised about the potential for uniformity in design due to its streamlined approach.

AppWizard

March 18, 2026

ChatGPT’s free tier gets GPT 5.4 mini model with improved coding capabilities

OpenAI has introduced the GPT 5.4 mini and nano models, making advanced AI capabilities accessible to free users of the ChatGPT platform. The GPT 5.4 mini operates more than twice as fast as its predecessor and closely matches the performance of the larger GPT 5.4 model in key evaluations. These models are designed for environments where latency is critical, excelling in coding, reasoning, multimodal understanding, and tool utilization. The GPT 5.4 mini is available in ChatGPT’s free and Go tiers, as well as in OpenAI’s API and Codex, while the nano variant is accessible exclusively through the API, both at lower costs than the original GPT 5.4 model.

AppWizard

February 26, 2026

Google details MCP-like ‘AppFunctions’ that let Gemini use Android apps

Google has introduced early-stage developer capabilities for Android aimed at connecting applications with intelligent agents and personalized assistants, specifically Google Gemini, while prioritizing privacy and security. A key feature of this initiative is AppFunctions, introduced with Android 16, which allows applications to expose specific capabilities for access by agent apps, enabling seamless task execution on devices. Developers can define app functionalities for AI assistants, facilitating various use cases such as task management, media creation, cross-app workflows, and calendar scheduling. A practical example includes the Samsung Gallery app, where users can request specific photos through Gemini, which triggers the appropriate function to retrieve them. Additionally, Google is advancing a UI automation framework for AI agents, allowing for the execution of generic tasks across applications with minimal coding. Future expansions of these capabilities are planned for Android 17, with ongoing collaboration with select app developers to enhance user experiences.

AppWizard

February 24, 2026

Circle to Search has 3 secret powers most people don’t know about

Circle to Search has reached its second anniversary, marking a significant milestone for Google. It was introduced to Android as a practical application of artificial intelligence and has evolved to include enhanced functionalities relevant in 2026. Users can access the generative AI model Nano Banana directly through Circle to Search for image creation and editing, streamlining the remixing process. The tool also features a full-screen translation capability that allows instant translation of text displayed on screens across various apps and websites, supporting multiple languages and enabling scrolling translations. Additionally, Circle to Search can scan QR codes and barcodes displayed on screens, functioning similarly to the Camera app. Its capabilities include text selection, image searching, generative AI, code scanning, song recognition, and on-screen translation, making it a versatile tool that enhances user experience. The Google Pixel 10 is highlighted as an ideal companion for Circle to Search, equipped with AI-powered tools that enhance overall user experience.

AppWizard

December 31, 2025

Google Revamps Gemini AI Android App with Dynamic Floating Input Bar

Google is refining the user experience of its Gemini app on Android devices by transitioning from a static prompt bar to a fluid, floating pill-shaped design. This update is currently available in beta versions and aims to address user feedback regarding visual clutter. The new design condenses input options into a sleek bar that expands when the keyboard is activated, hiding some tools behind a ‘+’ menu to streamline interactions. The redesign reflects Google’s commitment to making AI interactions more intuitive and aligns with broader trends in minimalism among AI interfaces. Additionally, the update could influence third-party integrations and may be part of a strategy to fully replace Google Assistant with Gemini by 2026. User reactions to the redesign are mixed, with some expressing excitement over the modern look while others are concerned about hidden features complicating usability.

AppWizard

December 24, 2025

4 Essential Android Smartwatch Apps You Should Always Install First

Google Gemini's Wear OS app is a successor to Google Assistant, offering voice control, AI-driven reasoning, and automation for tasks like checking the weather, sending messages, and managing smart home devices. It enhances fitness app management, allowing users to control workouts without touching the device. Gemini integrates with Google services for real-time health inquiries and improves email productivity by summarizing and drafting responses. Google Keep is a note-taking app that allows text and voice input for capturing ideas and creating lists on an Android smartwatch. It features quick access through watch tiles and complications but lacks full editing capabilities. Strava is a fitness tracking app that allows users to monitor activities directly from their smartwatch, using built-in GPS to log routes and metrics. It syncs data to the mobile app after workouts, though social features are limited on the watch. SleepisolBio is a sleep management app that tracks heart rate and sleep patterns, offering therapy options and personalized recommendations. It operates on a freemium model, providing many features for free, but requires smartphone setup for effective use on a smartwatch.

AppWizard

December 9, 2025

Connected, Creative, Expanded: Android XR’s Next Wave of Innovation Enhances the Galaxy XR Experience

Samsung Electronics unveiled the Galaxy XR in October, which operates on the new Android XR platform co-created with Google and Qualcomm. The device features multimodal AI for seamless interactions through voice, vision, and gesture, and offers full immersion capabilities. The latest update to the Android XR platform includes three significant features: PC Connect, Likeness, and Travel Mode. PC Connect allows users to integrate applications or their entire computer desktop into the immersive view of the Galaxy XR, enhancing productivity and entertainment. Likeness enables users to customize their appearance during video calls on platforms like Google Meet, enhancing the sense of presence. Travel Mode allows users to create personal cinema or workspace experiences while on the go, with a stable view and a portable Travel Case. These features will roll out to users starting December 8 in the United States and Korea.

AppWizard

December 8, 2025

I saw the future of Android XR smart glasses, and Google left me stunned at the progress

Last week, a demonstration of Android XR glasses took place at Google's Hudson River office, showcasing features such as visual assistance and gyroscopic navigation. These glasses are part of a developer kit for Android developers. Google aims to integrate these devices with Android phones and smartwatches by 2026. The strategy for AI glasses includes two types: one focusing on audio and camera features, and another incorporating a display for visual cues. Developer Preview 3 of the Android XR SDK is set to launch soon, supporting a wide range of existing third-party Android apps. The glasses can display navigation routes and driver information for Uber rides. Gemini, the assistant, provides contextual information immediately upon wearing the glasses. The Samsung Galaxy XR headset has new features like PC Connect and travel mode, while Xreal's Project Aura glasses offer a 70-degree field of view and access to Android apps. The anticipated price for Project Aura could be around ,000, with a potential late next year launch.

AppWizard

December 8, 2025

Gemini revamps its web interface with fresh look and new ‘My Stuff’ folder

Google has redesigned its Gemini web interface, featuring a cleaner look and an updated dark theme. The new "My Stuff" folder simplifies access to previous content, while UI improvements coincide with the launch of Gemini's latest AI models, including Gemini 3 Pro and Nano Banana Pro. The homepage now greets users with "Hi," and includes a prompt bar with suggestions, along with a spinning animation of the Gemini logo. The left menu has been adjusted to include "My Stuff," allowing users to store images, videos, and Canvas creations separately from chat interactions. The chat interface has enhancements such as a dropdown menu for sharing, pinning, renaming, and deleting conversations.