Fair, flexible pricing.
Built to scale with you.

Pay only for what you use. Get started with $200 in free API credits.

Simple rates for transcription

  • $0.10/hour for async (file uploads)
  • $0.12/hour for real-time (streaming)

Both prices are estimates based on typical usage and include audio + text tokens.

Token-based pricing for advanced use cases

If you’re building multilingual apps, using custom context, or need fine-grained control, we calculate costs based on tokens.

 
Async
Real-time
 
Input audio tokens
$1.50 per 1M tokens
$2.00 per 1M tokens
Duration of audio or streaming session
Input text tokens
$1.50 per 1M tokens
$2.00 per 1M tokens
Custom instructions or context you provide (docs)
Output text tokens
$3.50 per 1M tokens
$4.00 per 1M tokens
Transcription and optionally translation or other text returned by the model

Usage reference

  • 1 hour of audio is ~30,000 input audio tokens
  • 1 hour of speech is ~15,000 output text tokens
  • 1 character of output is ~0.3 tokens

No surprises

Billing is based on actual usage. Nothing more.

  • $200 in free API credits
  • No minimums
  • No tiered pricing
  • No lock-in
Get API key

Custom pricing available

Processing millions of minutes per month or need a custom license? We offer tailored plans to fit your needs.