Get started
Learn how to generate speech with the Soniox Text-to-Speech API.
Learn how to use the Soniox Text-to-Speech API in minutes
Soniox Text-to-Speech is built for the hardest parts of speech generation. It delivers native-speaker-quality speech in 60+ languages, with hallucination-free output and accurate pronunciation of alphanumerics such as phone numbers, email addresses, and IDs.
Soniox TTS is optimized for ultra-low latency and can start generating speech from the first few words, before the full sentence is available. It is available through WebSocket streaming and request-response generation over REST.
Use this guide to run your first Text-to-Speech request.
Get API key
Create a Soniox account and log in to the Console to get your API key.
API keys are created per project. In the Console, go to My First Project and click API Keys to generate one.
Export it as an environment variable (replace with your key):
Run examples
Next steps
- Dive into the Real-time API → Stream audio as text arrives. Ideal for voice agents and LLM-driven applications.
- Explore the REST API → Generate full audio files in a single request. Ideal for server-side and batch generation.