Fair, flexible pricing.
Built to scale with you.
Pay only for what you use. Get started with $200 in free API credits.
Simple rates for transcription
- $0.10/hour for async (file uploads)
- $0.12/hour for real-time (streaming)
Both prices are estimates based on typical usage and include audio + text tokens.
Token-based pricing for advanced use cases
If you’re building multilingual apps, using custom context, or need fine-grained control, we calculate costs based on tokens.
Async
Real-time
Input audio tokens
$1.50 per 1M tokens
$2.00 per 1M tokens
Duration of audio or streaming session
Input text tokens
$1.50 per 1M tokens
$2.00 per 1M tokens
Custom instructions or context you provide (docs)
Output text tokens
$3.50 per 1M tokens
$4.00 per 1M tokens
Transcription and optionally translation or other text returned by the model
Usage reference
- 1 hour of audio is ~30,000 input audio tokens
- 1 hour of speech is ~15,000 output text tokens
- 1 character of output is ~0.3 tokens
No surprises
Billing is based on actual usage. Nothing more.
- $200 in free API credits
- No minimums
- No tiered pricing
- No lock-in
Custom pricing available
Processing millions of minutes per month or need a custom license? We offer tailored plans to fit your needs.