Introducing Gemini Omni
Summary
Google has introduced Gemini Omni Flash, a new AI model that can generate and edit videos from text, images, audio, or video inputs combined together. The model uses reasoning about physics and real-world knowledge to create realistic videos and allows users to edit them through natural language conversation (giving text instructions rather than using traditional editing tools), with changes building on each other while maintaining consistency in characters, physics, and scene details.
Classification
Affected Vendors
Related Issues
Original source: https://deepmind.google/blog/introducing-gemini-omni/
First tracked: May 19, 2026 at 02:00 PM
Classified by LLM (prompt v3) · confidence: 95%