Models / Languages
We offer speech recognition models for various languages and domains. However, we offer only high accuracy speech recognition solutions, meaning that you should be able to transcribe any audio in supported languages/domains and obtain a highly accurate transcription. For comparison with other providers, please see Benchmark Data.
All the models also support real-time and low-latency speech recognition.
You should always use the General domain model, unless your use case comes from Medical or IVR domain. At the moment, Medical and IVR models are supported only with the English language.
Language | Domain | Config | Speaker AI Supported |
English | General | en_precision | Yes |
Medical | en_precision_medical | Yes | |
IVR | en_precision_ivr | Yes | |
Spanish | General | es_precision | No |
German | General | de_precision | No |
You can configure which model to use by setting the model
parameter to one of the supported values (e.g. en_precision_medical).
The model
parameter can be found in the transcribe functions in the client libraries or in the TranscriptionConfig.