Language hints
Enhance recognition accuracy by providing language hints parameter.
Soniox Speech-to-Text AI is a single model that transcribes speech in over 50 languages with extreme accuracy. There is no need to pre-select a language — Soniox STT automatically detects and transcribes speech in any of the supported languages. Additionally, it can seamlessly handle multilingual audio, even when different languages are mixed within a single recording.
If you expect the audio to contain a specific language or set of languages, you can provide language hints using the language hints parameter. This guides the AI model toward the expected language(s), which may further enhance recognition accuracy.
If no language hints are specified, the AI model will automatically detect and transcribe all spoken languages within the audio, whether it's a single language or multiple languages mixed together.
See the list of supported languages by Soniox Speech-to-Text AI.
Examples
The language_hints
parameter containing a list of language codes can be passed as payload when calling the Create Transcription API endpoint.
Example payload that hints towards English and Spanish:
Code example using language hints: