Video to Video

Kling Lipsync

Add lip-synced speech to any video.

Schema

Hint: You can drag and drop a file or click to upload

Example

Sample output shown above. The prompt and settings are pre-filled on the left.

Your request will cost $0.15 per run. For $10 you can run this model approximately 66 times.

One more thing:

README

Kling / Kling Lipsync β€” Video to Video (kling-lipsync)

Kling Lipsync adds synchronized speech to videos using text-to-speech synthesis.

Highlights

  • β€’Accurate lip sync – Realistic mouth movements matching synthesized speech.
  • β€’Voice selection – Multiple voice options for different styles.
  • β€’TTS synthesis – Automatic speech generation from text.

Parameters

  • β€’video* – Input video to add lip sync to (MP4/MOV, ≀100MB, 2-60s, 720p/1080p)
  • β€’text* – The text to synthesize into speech (max 120 characters)
  • β€’voice_id* – Voice to use for speech synthesis
    • β€’ Klee (Genshin)
    • β€’ Vindi (Genshin)
    • β€’ Kirara (Genshin)
    • β€’ Kaiya
    • β€’ Shatang
    • β€’ Male Voice 1
    • β€’ Student Voice
    • β€’ AOT Voice
  • β€’voice_language – Language for speech synthesis
    • β€’ English
    • β€’ Chinese
  • β€’voice_speed – Speech rate (0.8 to 2.0)

Pricing

$0.50 per generation

How to Use

  1. 1.Upload a video with a visible face (2-10s, 720p/1080p).
  2. 2.Enter the text you want the character to speak.
  3. 3.Select a voice style.
  4. 4.Download your lip-synced video.

More Models to Try

β€’Dubbing: Add new dialogue to existing videos.
β€’Translations: Create localized video content.
β€’Presentations: Make videos with speaking avatars.

Frequently Asked Questions

What is the Kling Lipsync API?
Add lip-synced speech to any video.
How much does Kling Lipsync cost via API?
Kling Lipsync costs $0.5000 per generation through Renderful's API. No subscription required β€” pay only for what you use.
How do I use Kling Lipsync via API?
Sign up for a free Renderful API key, then send a POST request to the /v1/predictions endpoint with model "kling-lipsync". See the documentation at renderful.ai/docs for code examples in Python, JavaScript, and cURL.
What type of content does Kling Lipsync generate?
Kling Lipsync is a video to video model by Kling. Key features include: 2-10s videos, Lip sync, TTS synthesis.
Is the Kling Lipsync API fast?
Kling Lipsync has medium generation speed. Results are delivered via polling or webhook callback for seamless integration.