Gemini Omni Video is a multimodal video generation model that supports text, image, video, audio, and character references for rich, controllable output.

Points forts

•Multimodal – Combine text, images, video clips, audio, and characters.
•4K support – Generate up to 4K resolution videos.
•Character consistency – Reuse character IDs across generations.
•Flexible duration – 4, 6, 8, or 10 seconds.

Paramètres

•prompt* – Text description of the video to generate
•image_urls – Reference images (max 5). Video uses 2 image slots out of 7 total.
•video_url – Reference video (max 1). Formats: mp4, mov. Uses 2 image slots.
•video_start – Start timestamp (seconds) for the reference video clip. Defaults to 0.
•video_end – End timestamp (seconds) for the reference video clip. Defaults to full video length.
•audio_ids – Audio ID list (max 1). Obtain IDs from the Kie audio library.
•character_ids – Character ID list (max 3). Each character uses 1 image slot. Obtain IDs from the Kie character library.
•
duration – Video duration in seconds
- • 4 seconds
- • 6 seconds
- • 8 seconds
- • 10 seconds
•
aspect_ratio – Video aspect ratio
- • 16:9 (Landscape)
- • 9:16 (Portrait)
•
resolution – Video resolution
- • 720p
- • 1080p
- • 4K
•seed – Random seed for reproducibility (0-2147483647)

Tarification

$0.69 par génération

Comment utiliser

1.Write a prompt describing your scene.
2.Optionally attach image, video, or audio references.
3.Choose duration, resolution, and aspect ratio.
4.Submit and download your video.

Autres modèles à essayer

•Storytelling: Multi-reference video scenes.

•Marketing: Brand-consistent video content.

•Creative projects: Reference-driven generation.

Frequently Asked Questions

What is the Gemini Omni Video API?

Multimodal video generation with omni references.

How much does Gemini Omni Video cost via API?

Gemini Omni Video costs $0.6900 per generation through Renderful's API. No subscription required — pay only for what you use.

How do I use Gemini Omni Video via API?

Sign up for a free Renderful API key, then send a POST request to the /v1/predictions endpoint with model "gemini-omni-video". See the documentation at renderful.ai/docs for code examples in Python, JavaScript, and cURL.

What type of content does Gemini Omni Video generate?

Gemini Omni Video is a text to video model by Google. Key features include: 4-10s videos, 4K support, Image refs.

Is the Gemini Omni Video API fast?

Gemini Omni Video has medium generation speed. Results are delivered via polling or webhook callback for seamless integration.

Gemini Omni Video

Modèles associés

README

Google / Gemini Omni Video — Text to Video (gemini-omni-video)