Get started
Learn how to use Soniox Speech-to-Text API.
Learn how to use Soniox Speech-to-Text API in minutes
Soniox Speech-to-Text is a universal speech AI that lets you transcribe speech in 60+ languages — from recorded files (async) or live audio streams (real-time). Languages can be freely mixed within the same conversation, and Soniox will handle them seamlessly with high accuracy and low latency.
In just a few steps, you can run your first transcription. The examples also cover real-time and async transcription flows through the same simple API.
Get API key
Create a Soniox account and log in to the Console to get your API key.
API keys are created per project. In the Console, go to My First Project and click API Keys to generate one.
Export it as an environment variable (replace with your key):
Run examples
Choose your language and run the ready-to-use examples below.
Example | What it does | Output |
|---|---|---|
| Real-time transcription | Transcribes speech in any language in real-time. | Transcript streamed to console. |
| Transcribe file from URL | Transcribes an audio file directly from a public URL. | Transcript printed to console. |
| Transcribe local file | Uploads and transcribes an audio file from your computer. | Transcript printed to console. |
Next steps
- Dive into the Real-time API → Run live transcription and endpoint detection.
- Explore the Async API → Transcribe recorded files at scale and integrate with webhooks.