About this model
Gemini Omni Flash is the text-to-video entry point for Google's new "Omni" model family, which the company frames as a step toward creating and editing "anything from any input," starting with video. Google DeepMind describes it as combining Gemini's intelligence with its generative media systems, and the developer docs list it as a preview model built for fast, conversational video generation and editing. In sibling configurations it also accepts images and existing clips as input.
Google's Omni family sits alongside its established Veo video line, such as Veo 3.1 Full Quality and Veo 3 Full Quality. Its headline change over prior prompt-to-video tools is stateful, conversational editing: because context persists across turns, each revision refines an existing take rather than restarting from scratch. For image and reference-driven work, see Gemini Omni Flash and Gemini Omni Flash R2V.
Generated videos include SynthID watermarking, Google's approach for tagging AI-generated media.
This About section is AI-generated from public sources (Claude Opus 4.8), with no human editing. It may contain inaccuracies — verify critical details against the sources listed above.
Data sources: Venice API · HuggingFace · Wikipedia — enrichment updated 16h ago