Customization
Learn how to use custom context to enhance trancription accuracy.
Overview
Soniox Speech-to-Text AI allows you to enhance transcription accuracy by providing custom context for each transcription session. This feature is especially useful when working with:
- Industry-specific terminology
- Brand names or product names
- Uncommon names or made-up words
- Domain-specific documents or phrases
By providing context, you help the AI model better understand and anticipate the language in your audio — even if some terms do not appear clearly or completely.
How context works
The context
parameter accepts any text that may be relevant to the transcription session. This text
is not required to appear in the audio — it simply acts as guidance for the model to improve
recognition accuracy when necessary.
The model uses the provided context only when helpful, and it does not override normal speech recognition behavior.
Supported context types
You can supply many types of text to the context
parameter, such as:
List of terms or keywords
Useful for proper nouns, technical vocabulary, or product names:
Full text or summary
Provide a paragraph, summary, or reference document related to the audio content:
Context size limit
- The
context
can contain up to 8,000 tokens (roughly 10,000+ characters) - This allows you to include substantial information, including summaries, scripts, or glossary-style entries
If the context exceeds the limit, the API will return an error — be sure to trim or summarize as needed.
Best practices
- Use commas or spacing to separate terms in short lists
- Keep context relevant to the session — don't overload with unrelated data
- Preprocess content from related documents (e.g., transcripts, emails, product info) into a clean context block
Use cases
Use case | Example context |
---|---|
Medical transcription | Medication names, procedure terms, doctor/patient names. |
Call center recordings | Customer name, agent info, company-specific lingo. |
Industry-specific jargon | Terms from legal, finance, biotech, or tech domains. |
Podcasts / interviews | Guest names, brand mentions, episode summaries. |
Custom words and neologisms | Fictional terms, product names, made-up branding. |
Example: Custom word recognition
The following example demonstrates how to transcribe audio containing words
Celebrex, Zyrtec, Xanax, Prilosec, Amoxicillin Clavulanate Potassium
by
including them in the context: