video generation Archives

AppWizard

July 11, 2025

Gemini’s new photo-to-video breathes animated life into your memories

Google has introduced a new feature in the Gemini app that allows users to convert still images into dynamic videos using Veo 3 technology. Subscribers of Google AI Pro or Ultra can upload a photo and provide a text description for the video output, including specifying soundscapes. Earlier, Google Cloud launched a similar feature for Honor's 400 series with Veo 2, which required a subscription after a trial period. Veo 3 was highlighted at the I/O 2025 event for its audio capabilities and lifelike video generation, and YouTube is considering integrating it into its Shorts platform. Google has also stated that it has implemented measures to ensure user safety with these AI tools.

AppWizard

June 19, 2025

Google’s new Veo 3 could land on YouTube Shorts this summer

YouTube's CEO, Neal Mohan, announced at the Cannes Lions Festival that the platform will integrate the advanced Veo 3 model into its Shorts feature. This integration aims to enhance creative capabilities for users and is expected to revolutionize video generation by allowing creators to generate videos using natural language. Veo 3 will enable the creation of immersive backgrounds and complex visualizations, initially limited to six-second clips. The audio features of Veo 3 were highlighted during Google's I/O 2025 event and can be accessed through a Google AI Ultra subscription.

AppWizard

May 21, 2025

New Google AI Pro and $249/month Ultra subscription announced at I/O

On May 20, Google launched two new artificial intelligence subscription plans: AI Pro and AI Ultra. The AI Ultra plan is currently only available in the United States but will expand to other regions soon. The AI Pro plan enhances the existing Gemini Advanced plan with features like Flow, a new AI filmmaking tool, and Notebook LM, offering higher rate limits and integrated capabilities across Google applications such as Gmail, Docs, and Vids. It provides up to 2 TB of storage for .99 per month, with a 50% discount for the first three months. The AI Ultra plan includes advanced models like Veo 3 and 2.5 Pro Deep Think, early access to experimental features, and a YouTube Premium subscription, along with 30 TB of storage. The Ultra plan also features Project Mariner and Agent Mode for streamlined task management.

AppWizard

May 3, 2025

Google One AI could have more tiers on the way, like a ‘Gemini Ultra’ plan

Google is reportedly planning to introduce new AI subscription tiers under the Google One umbrella, specifically "Gemini Pro" and "Gemini Ultra," alongside the existing Gemini Advanced option. The Gemini Advanced plan has recently been upgraded to include NotebookLM Plus and Veo 2 for video generation, but there will be a monthly limit on video production for subscribers. Google One's AI Premium plan has also been enhanced, offering existing users access to NotebookLM Plus and advanced Gemini capabilities in Gmail and Docs, along with 2TB of cloud storage. Upcoming updates are expected to be revealed at the I/O 2025 event, with a focus on improving user productivity and introducing new interactions.

AppWizard

April 15, 2025

Gemini Advanced, Whisk users pick up Veo 2 for shareable cinematic video clips

Google has launched Veo 2, a video generation technology available for Gemini Advanced users and Whisk experimental testers, which converts text descriptions into eight-second video clips suitable for platforms like YouTube Shorts and TikTok. Users can create diverse video content by providing detailed text prompts. A new "Share" button facilitates uploading to social media. Each video will be tagged with SynthID to indicate it is AI-generated, and Gemini Advanced users will have a monthly limit on video creation. Veo 2 is also featured in Whisk, which includes "Whisk Animate," allowing users to animate images into cinematic-quality videos. Access to Whisk Animate is limited to Google One AI Premium subscribers. Whisk encourages users to generate content by providing images based on subject, scene, and style, while also offering a "Review & Edit" feature to refine AI outputs.

AppWizard

March 28, 2025

How to use ChatGPT: A beginner’s guide to the most popular AI chatbot

ChatGPT, developed by OpenAI, was launched in late 2022 and has evolved into a versatile tool for tasks such as composing emails, generating essays, brainstorming ideas, translating languages, assisting with coding, and answering questions. OpenAI offers a free version and paid plans for additional features. Users can access ChatGPT via its website or app without creating an account, although registration unlocks features like saving chat history. The user base has grown to over 400 million weekly users within six months. OpenAI's subscription plans include a free tier, a Plus plan at per month, and a Pro plan at 0 per month. ChatGPT can assist with advice, data analysis, browsing, deep research, image generation with DALL·E, and file uploads. Features include Voice Mode for spoken interaction, ChatGPT Canvas for visual organization, and ChatGPT Tasks for scheduling reminders. ChatGPT can also generate images and videos, with the upgraded GPT-4o image generator available to Plus subscribers. The desktop app is available for Windows and MacOS, and users can try ChatGPT without logging in. Accuracy may vary, and users are advised to fact-check responses.

AppWizard

February 18, 2025

Gemini could soon be upgraded with video generation features (APK teardown)

Google is exploring the incorporation of video generation capabilities into its AI-powered digital assistant, Gemini, as indicated by references to “videogen” found in the code of the Google app version 16.6.23. The term “robin” is also linked to this feature, suggesting a connection to Gemini's existing capabilities. Currently, Google offers video generation tools through its Google Vids platform, which assists users in creating videos from concepts, but the integration of video generation within Gemini could enhance user experience. The video generation feature is not yet live, and its release timeline is uncertain.

Winsage

December 26, 2024

Hands on: Microsoft made it easier to run AI models on Windows 11 locally

Copilot+ PCs are the first personal computers to run Small Language Models (SLM) directly on-device, allowing for quicker interactions without relying on the cloud. Microsoft has introduced the AI Dev Gallery, which offers over 25 samples for developers to integrate on-device AI features into applications on Windows 10 and 11. The gallery requires building the project in Visual Studio, needing at least 20GB of storage and a multi-core CPU. A GPU with 8GB VRAM is recommended for heavier models but not mandatory for lighter applications. The app has two operational modes: Sample and Models. Testing models for image generation typically requires around 5GB of bandwidth, while a smaller image upscaling model under 100MB was successfully tested, completing the process in under 30 seconds with peak RAM usage of 1GB. The resulting image resolution was 9272x4900, but clarity issues were noted, especially with text. The application lacks features for previewing images in larger formats or downloading outputs directly. A model named Detect Human Pose was able to identify positions within images, including desktop screenshots. Substantial storage and robust CPUs are necessary for effective model accommodation, and the practicality of downloading large models for niche use cases is questioned.

AppWizard

November 17, 2024

The World’s First AI-Generated Game Is Playable By Anyone Online, And It Is Surreal

A new online game titled Oasis has emerged, utilizing artificial intelligence (AI) to generate every frame, creating a unique experience similar to Minecraft. Players explore a 3D world made of square blocks, mining resources and crafting items, with the environment evolving in real-time based on their actions. The game allows for customization by enabling players to upload their own images for personalized scenes. The game engine has been trained on millions of gameplay hours, replicating actions like moving and breaking blocks. Players may encounter peculiarities such as unexpected items in their inventory and a landscape that warps when not directly observed. The developers view Oasis as an exploration of AI's potential in gaming, with aspirations for AI-generated content to adapt to user preferences in real-time. Oasis is currently available for play, though players may need to join a queue.

AppWizard

October 31, 2024

This AI-generated Minecraft may represent the future of real-time video generation

Decart and Etched have developed a unique version of Minecraft that features real-time content generation, allowing players to experience unexpected transformations in the game environment due to AI "hallucinations." This innovation is powered by a model called Oasis, trained on extensive Minecraft gameplay data, enabling the AI to understand game mechanics without traditional coding. The current demo has limitations such as low resolution and short play sessions, but the companies are optimistic about future enhancements through advancements in chip design. Etched is working on a new chip that aims to improve performance by tenfold, which could lead to longer gameplay, fewer hallucinations, and better resolution. This chip is designed specifically for AI applications, focusing on inference rather than training. Despite skepticism about achieving the projected performance gains, Decart and Etched envision the potential for creating real-time virtual assistants like doctors or tutors. A demo of their AI-generated Minecraft experience is available for public exploration.