Google is reportedly working on “Gemini Omni,” a new AI-powered system that’s all about multimodal creation.
Based on the latest leaks and online reports, this platform could make its first big appearance around Google I/O 2026.
Gemini Omni isn’t just another tool, it seems designed to roll together video generation, conversational editing, remixing options, and support for AI avatars, so users can create and edit right where they chat.

These leaks suggest that Google is experimenting with an all-in-one creative platform inside Gemini. Gemini Omni appears to be less of a traditional standalone video model and more of an AI agent, one that manages entire media workflows from start to finish. It apparently relies on Google’s power tools, like Veo, Imagen, and Gemini itself. The goal is to let users blend AI-powered video generation with conversational editing and custom avatars.

What is Gemini Omni?

Gemini Omni looks like Google’s next move for a multimodal AI creation platform, one built into the Gemini ecosystem. It seems designed to go further than current AI video generators, which mostly just give out clips from prompts. With Omni, the goal appears to be a complete, AI-powered production workflow.

Some leaked interface details include lines like “Create with Gemini Omni,” “edit directly in chat,” and “remix your videos”. That means the system may work through natural conversation, not traditional editing programs. It’s also said to offer templates, AI avatars, and identity systems you can reuse.

The “Omni” branding points to a platform that doesn’t stick with just one content type, but supports text, images, audio, and video at once. That matches Google’s larger plan which is turning Gemini into a unified AI operating layer instead of a simple chatbot.

GOOGLE I/O 🔥: New evidence of the upcoming Gemini Omni vide model has been spotted on the Gemini mobile app.

A video sample below 👀

> "Meet our new video model. Remix your videos, edit directly in chat, try a template, and more."

> Based on the description, we might be… pic.twitter.com/FOAgYRuxAV
— 🚨 AI News | TestingCatalog (@testingcatalog) May 11, 2026

What does Gemini Omni do?

Although many details remain unofficial, leaks and reports provide a clearer picture of Gemini Omni’s expected capabilities.

Key reported features include:

AI video generation straight from text prompts
Conversational video editing inside Gemini chat
Tools for video remixing and changing style
Template-based content creation for repeatable formats
Support for AI avatars and “Likeness” features
Characters that look and act the same across different projects
Tools for creating with text, images, and video together

One of the most talked about features is conversational editing. Instead of managing classic timelines, users might just tell Gemini Omni what to change, like changing the lighting, swapping the background, or shifting the camera angle. This could make sophisticated video editing a lot more accessible for creators and businesses.

Another big feature reportedly in the works is avatar integration. Separate leaks tied to Google’s “Likeness” project suggest you’ll be able to scan yourself with a phone to create an AI-driven avatar. You could then drop this avatar into videos, presentations, and any digital media project making consistent and personal branding much easier.

This platform appears focused on creators and marketers. References to templates and remix tools sound perfect for users working on YouTube Shorts, TikTok, ads, and social campaigns and not just AI enthusiasts experimenting with new tech.

But, Gemini Omni may also face several challenges.

Here’s what’s appealing:

AI makes video production easier and faster
You get smooth editing via natural language prompts
Workflows become more automated
All creation tools are in one place, integrated with Google’s services

Potential concerns include:

Ethical issues around AI avatars and “likeness” copying
Deepfake risks and misuse
High compute costs for advanced video work
Reliance on Google’s cloud AI infrastructure
Copyright questions on content generated by the system

Some people think Gemini Omni’s next step might be as a full AI production agent. Instead of simply generating clips, it could storyboard, keep visuals consistent, manage multiple content versions, and optimize videos for different platforms on its own.

Also read: Gemini in Chrome Could Transform Everyday Android Web Browsing

Conclusion

Gemini Omni marks a potential turning point in Google’s strategy. Rather than dropping another video model, it looks like the company’s building an all-in-one multimodal space where you can generate, edit, remix, and brand your content just by talking to the AI.

Most details are still from leaks and early reports, but the rumoured feature set suggests Google wants Gemini to compete not only in AI video but also in workflow automation and smart content production. If Google unveils it at I/O 2026, Gemini Omni could become one of their biggest AI announcements yet recasting Gemini as a creative operating system, not just another chatbot.

Devanshi Kashyap

Devanshi is a curious learner who enjoys exploring new ideas and expressing creativity through art.