Get started
Learn how to generate speech with the Soniox Text-to-Speech API.
Learn how to use the Soniox API in minutes
Soniox Text-to-Speech is built for the hardest parts of speech generation. It delivers native-speaker-quality speech in 60+ languages, with hallucination-free output and accurate pronunciation of alphanumerics such as phone numbers, email addresses, and IDs.
Soniox TTS is optimized for ultra-low latency and can start generating speech from the first few words, before the full sentence is available. It is available through WebSocket streaming and request-response generation over REST.
Use this guide to run your first Text-to-Speech request.
Get API key
Create a Soniox account and log in to the Console to get your API key.
API keys are created per project. In the Console, go to My First Project and click API Keys to generate one.
Export it as an environment variable (replace with your key):
Run examples
Next steps
- Real-time generation over WebSocket — streaming audio as text is sent.
- REST API — single request/response calls.