Question 1

How much does the Soniox API cost?

Accepted Answer

Speech-to-Text is $0.10/hour for async (file uploads) and $0.12/hour for real-time (streaming). Advanced use cases with translation, custom context, or fine-grained control are billed by token usage. Text-to-Speech is token-based, about $0.70/hour of generated speech. Use the calculator above to estimate your spend.

Question 2

How does Soniox compare to Google, Azure, and OpenAI on price?

Accepted Answer

Soniox real-time is $0.12/hour. Google Speech-to-Text V2 starts around $0.96/hour and Azure Speech around $1.00/hour for real-time, so Soniox is roughly 8x less. OpenAI has no native streaming and starts at $0.36/hour for batch transcription.

Question 3

Is translation included, or billed separately?

Accepted Answer

Included. Soniox transcribes and translates across 60+ languages in the same real-time API call at no extra cost. OpenAI, Google, and Azure bill translation as a separate service (Azure’s add-on alone is about $2.50/hour).

Question 4

Is Soniox cheaper than Deepgram?

Accepted Answer

Yes. Soniox is $0.10–0.12/hour, while Deepgram Nova-3 with comparable add-ons (keyterms, diarization) runs about $0.39–0.55/hour, roughly 4–5x more. See the full breakdown on our Soniox vs Deepgram page.

Question 5

Do I pay extra for diarization, language detection, or formatting?

Accepted Answer

No. Speaker diarization, language identification, and smart formatting are bundled into the hourly rate. Most providers charge these as add-ons, for example Deepgram diarization adds about $0.12/hour and Azure real-time diarization about $0.30/hour.

Question 6

What is the difference between real-time and async pricing?

Accepted Answer

Real-time streaming is $0.12/hour and async file transcription is $0.10/hour. Both run the same model with the same accuracy and features.

Question 7

How much does Soniox Text-to-Speech cost?

Accepted Answer

Text-to-Speech is token-based: $4.00 per 1M input text tokens and $21.50 per 1M output audio tokens, about $0.70 per hour of generated speech.

Fair, flexible pricing.
Built to scale with you.

Stop overpaying for speech AI

Speech-to-Text API pricing

Token-based pricing

Text-to-Speech API pricing

Token-based pricing

Breakthrough innovation is why Soniox costs less

Built for real-time speech AI

Custom inference engine

Massive concurrency

Frequently asked questions

Ready to get started?

Documentation

See what you’ll pay

Fair, flexible pricing.Built to scale with you.

Stop overpaying for speech AI

Speech-to-Text API pricing

Token-based pricing

Text-to-Speech API pricing

Token-based pricing

Breakthrough innovation is why Soniox costs less

Built for real-time speech AI

Custom inference engine

Massive concurrency

Frequently asked questions

Ready to get started?

Documentation

See what you’ll pay

Fair, flexible pricing.
Built to scale with you.