~/Gemini Omni Video
Text to Video

Gemini Omni Video

Multimodal video generation with omni references.

Schema
Add images (0/5)

Upload up to 5 images

Hint: You can drag and drop a file or click to upload

Example

Input

Video
Input 1Input

Sample output shown above. The prompt and settings are pre-filled on the left.

Your request will cost $0.69 per run. ($0.115/s) For $10 you can run this model approximately 14 times.

One more thing:

README

Google / Gemini Omni VideoText to Video (gemini-omni-video)

Gemini Omni Video is a multimodal video generation model supporting text, image, video, audio, and character references.

Highlights

  • Multimodal Combine text, images, video clips, audio, and characters.
  • 4K support Generate up to 4K resolution videos.
  • Character consistency Reuse character IDs across generations.
  • Flexible duration 4, 6, 8, or 10 seconds.

Parameters

  • prompt*Text description of the video to generate
  • image_urlsReference images (max 5). Video uses 2 image slots out of 7 total.
  • video_urlReference video (max 1). Formats: mp4, mov. Uses 2 image slots.
  • video_startStart timestamp (seconds) for the reference video clip. Defaults to 0.
  • video_endEnd timestamp (seconds) for the reference video clip. Defaults to full video length.
  • audio_idsAudio ID list (max 1). Obtain IDs from the Kie audio library.
  • character_idsCharacter ID list (max 3). Each character uses 1 image slot. Obtain IDs from the Kie character library.
  • durationVideo duration in seconds
    • 4 seconds
    • 6 seconds
    • 8 seconds
    • 10 seconds
  • aspect_ratioVideo aspect ratio
    • 16:9 (Landscape)
    • 9:16 (Portrait)
  • resolutionVideo resolution
    • 720p
    • 1080p
    • 4K
  • seedRandom seed for reproducibility (0-2147483647)

Pricing

$0.69 per generation

How to Use

  1. 1.Write a prompt describing your scene.
  2. 2.Optionally attach image, video, or audio references.
  3. 3.Choose duration, resolution, and aspect ratio.
  4. 4.Submit and download your video.

More Models to Try

Storytelling: Multi-reference video scenes.
Marketing: Brand-consistent video content.
Creative projects: Reference-driven generation.

Frequently Asked Questions

What is the Gemini Omni Video API?
Multimodal video generation with omni references.
How much does Gemini Omni Video cost via API?
Gemini Omni Video costs $0.6900 per generation through Renderful's API. No subscription required — pay only for what you use.
How do I use Gemini Omni Video via API?
Sign up for a free Renderful API key, then send a POST request to the /v1/predictions endpoint with model "gemini-omni-video". See the documentation at renderful.ai/docs for code examples in Python, JavaScript, and cURL.
What type of content does Gemini Omni Video generate?
Gemini Omni Video is a text to video model by Google. Key features include: 4-10s videos, 4K support, Image refs.
Is the Gemini Omni Video API fast?
Gemini Omni Video has medium generation speed. Results are delivered via polling or webhook callback for seamless integration.