Shared concepts
Audio formats
Supported audio formats for Soniox Text-to-Speech.
Overview
This page lists supported audio_format values for Soniox Text-to-Speech.
Supported formats
pcm_f32lepcm_s16lepcm_mulawpcm_alawwavaacmp3opusflac
Raw PCM details
For PCM output, use one of the raw PCM encodings and set:
audio_format— the encoding type (pcm_f32le,pcm_s16le,pcm_mulaw, orpcm_alaw)sample_rate— output sample rate in Hz
Example:
Compressed format details
For compressed formats (mp3, opus, aac), you can also set:
bitrate— codec bitrate in bps
Example:
Full format reference
Supported sample rates and bitrates for all formats
Defaults are shown in bold.
| Format | Sample rates (Hz) | Bitrates (bps) |
|---|---|---|
pcm_f32le | 8000, 16000, 24000, 44100, 48000 | — |
pcm_s16le | 8000, 16000, 24000, 44100, 48000 | — |
pcm_mulaw | 8000 | — |
pcm_alaw | 8000 | — |
wav | 8000, 16000, 24000, 44100, 48000 | — |
flac | 16000, 24000, 44100, 48000 | — |
mp3 | 16000, 24000, 32000, 44100, 48000 | 32000, 64000, 96000, 128000, 192000, 256000, 320000 |
opus | 8000, 16000, 24000, 48000 | 16000, 32000, 64000, 96000, 128000, 256000 |
aac | 16000, 24000, 44100, 48000 | 32000, 64000, 96000, 128000, 192000, 256000, 320000 |