APIXO
Video

Gemini Omni

Gemini Omni brings Google's multimodal reasoning into video generation. Use APIXO to create async Gemini Omni videos from text, image references, video inputs, reusable audio assets, and character asset IDs.

720p / 1080p / 4k
4 / 6 / 8 / 10 seconds
Image, video, audio, and character references
16:9 / 9:16
Generated preview

Gemini Omni

Start now

Create with Gemini Omni

Loading workspace...

Gemini Omni Tips

Create video generation with Gemini Omni

Use Gemini Omni through APIXO to try prompts, inspect examples, compare costs, and keep useful settings ready for future generations.

Create

Start with an idea

Pick a model, tune the settings, and create in one place.

Write an idea

Describe what you want to make.

Use an example

Start from a ready-made style.

Adjust the look

Tune the key settings quickly.

Create

Generate when everything feels right.

Keep results

Save what you want to revisit.

Try again

Make the next version faster.

Gemini Omni API Technical Specs

Current Gemini Omni capabilities and important creation limits.

Gemini Omni Modes

Gemini Omni video generation with optional multimodal references

Gemini Omni Output

720p / 1080p / 4k

Gemini Omni Controls

4 / 6 / 8 / 10 seconds / Image, video, audio, and character references

Gemini Omni Latency

Typically 1-3 minutes

Google Multimodal Video API

Key Gemini Omni capabilities

Gemini Omni video generation

Use Gemini Omni for Gemini Omni video generation with optional multimodal references creation with simple controls for prompts, references, and output settings.

Gemini Omni reference control

Upload supported reference assets where available, then keep refining the same idea on the model page.

Gemini Omni generation status

Keep track of longer video generations while the result is being prepared.

What can you build with Gemini Omni?

Gemini Omni product videos

Create product reveals, ecommerce ads, launch clips, and campaign motion assets.

Gemini Omni social creative

Generate short videos for social channels and iterate quickly across prompt, duration, resolution, and ratio settings.

Gemini Omni storyboard workflows

Use text prompts and references to validate scenes, shots, and motion ideas before committing production budget.

Read FAQ

Gemini Omni notes and questions

The browser Playground is fixed to gemini-omni-video for this launch; audio and character asset creation are documented for API use.
Video generation requires prompt and resolution. Duration is required only when video_urls is not provided.
Supported video durations without source video are 4, 6, 8, and 10 seconds.
Video mode supports up to 3 audio IDs, up to 3 character IDs, and up to 1 source video URL.
The multimodal quota from image_urls, video_urls, and character_ids cannot exceed 7 units. Each image uses 1 unit, one source video uses 2 units, and each character ID uses 1 unit.
video_start and video_end require video_urls, cannot be negative, and video_end must be greater than or equal to video_start.
Actual latency varies by prompt complexity, reference inputs, selected resolution, and provider queue load.

Yes. Use the Create tab to try prompts, upload supported inputs, and adjust settings directly in the browser.

APIXO

Write a prompt, adjust the style, and generate your next image in one place.