Speech to Text

OpenAI Whisper with Video

Name: OpenAI Whisper with Video API
Brand: Renderful
Price: 0.006 USD
Availability: InStock

Transcribe video audio with timestamps.

สคีมา

video*

Hint: You can drag and drop a file or click to upload

language

timestampsInclude word-level timestamps for subtitle generation.

ตัวอย่าง

Sample Transcription

{"text": "Loading sample transcription..."}

Input

Video

ผลลัพธ์ตัวอย่างแสดงด้านบน พรอมต์และการตั้งค่าถูกกรอกไว้แล้วทางซ้าย

คำขอของคุณจะมีค่าใช้จ่าย $0.0005 ต่อการรัน สำหรับ $10 คุณสามารถรันโมเดลนี้ได้ประมาณ 20000 ครั้ง

อีกหนึ่งเรื่อง:

โมเดลที่เกี่ยวข้อง

Sora 2

Sora 2 Pro

GPT Image

GPT Image

GPT Image 1.5

GPT Image 1.5

README

OpenAI / OpenAI Whisper with Video — Speech to Text (openai-whisper-with-video)

OpenAI Whisper Large v3 Video-to-Text extracts audio from video files and provides accurate multilingual transcription with optional timestamped segments for subtitle creation.

จุดเด่น

•Direct video input – No need to extract audio separately.
•Timestamp support – Get word-level timestamps for subtitles.
•Multilingual – Supports 99+ languages with auto-detection.
•Production ready – No cold starts, predictable pricing.

พารามิเตอร์

•video* – Video file to transcribe. Supports MP4, MOV, WebM, and AVI formats.
•language – Language code (e.g., "en", "es", "fr"). Auto-detected if not specified.
•timestamps – Include word-level timestamps for subtitle generation.

ราคา

$0.006 ต่อการสร้าง

ความละเอียด	ราคา
Per minute	$0.006

วิธีใช้งาน

1.Upload a video file (MP4, MOV, WebM, etc.).
2.Enable timestamps for subtitle-ready output.
3.Optionally specify the language code.
4.Receive transcription with optional timestamps.

เคล็ดลับ

•Enable timestamps for subtitle generation workflows.
•Works best with clear dialogue and minimal background music.
•Auto language detection handles multilingual videos.

โมเดลอื่นที่น่าลอง

•Video subtitles: Generate SRT/VTT subtitle files.

•Content indexing: Make video content searchable.

•Accessibility: Add captions to video content.

•Video editing: Find specific moments by transcript.

Frequently Asked Questions

What is the OpenAI Whisper with Video API?

Transcribe video audio with timestamps.

How much does OpenAI Whisper with Video cost via API?

OpenAI Whisper with Video costs $0.0060 per generation through Renderful's API. No subscription required — pay only for what you use.

How do I use OpenAI Whisper with Video via API?

Sign up for a free Renderful API key, then send a POST request to the /v1/predictions endpoint with model "openai-whisper-with-video". See the documentation at renderful.ai/docs for code examples in Python, JavaScript, and cURL.

What type of content does OpenAI Whisper with Video generate?

OpenAI Whisper with Video is a speech to text model by OpenAI. Key features include: Video Input, Timestamps, Subtitle Segments.

Is the OpenAI Whisper with Video API fast?

OpenAI Whisper with Video has fast generation speed. Results are delivered via polling or webhook callback for seamless integration.

OpenAI Whisper with Video

โมเดลที่เกี่ยวข้อง

README

OpenAI / OpenAI Whisper with Video — Speech to Text (openai-whisper-with-video)

จุดเด่น

พารามิเตอร์

ราคา

วิธีใช้งาน

เคล็ดลับ

โมเดลอื่นที่น่าลอง

Related Guides

Frequently Asked Questions