Models

Soniox Text-to-Speech is built for the hardest parts of speech generation. It delivers native-speaker-quality speech in 60+ languages, with hallucination-free output and accurate pronunciation of alphanumerics such as phone numbers, email addresses, and IDs.

This page lists the currently available models, along with release notes and important updates.

Current models

Model	Type	Status
tts-rt-v1	Real-time	Active

Aliases

Aliases provide a stable reference so you don’t need to change your code when newer versions are released.

Alias	Points to	Notes
tts-rt-v1-preview	`tts-rt-v1`

Changelog

April 29, 2026

Overview

Soniox TTS is now generally available.

The preview model tts-rt-v1-preview is now available as the production model tts-rt-v1.
tts-rt-v1 is available to all API customers and deployed in all Soniox regions: US, EU, and JP.
For backward compatibility, tts-rt-v1-preview now points to tts-rt-v1 with no service interruption. We recommend updating your API requests to use tts-rt-v1.

April 23, 2026

Overview

tts-rt-v1-preview is the first Soniox Text-to-Speech model, released in preview to gather developer feedback and guide further improvements before general availability.

Key capabilities

Native-speaker-quality speech in 60+ languages
Hallucination-free generation, with no invented words, dropped content, or unexpected substitutions
Accurate rendering of alphanumerics such as email addresses, phone numbers, street addresses, IDs, and codes
Streaming generation before the sentence ends for ultra-low-latency voice systems
Multiple voices that work across all supported languages
Configurable audio formats, sample rates, and bitrates
Support for both WebSocket and REST APIs

Models

On this page