Live English speech translation API
Translate English speech to and from 60+ languages in real time. Low-latency streaming, 3,600 language pairs, and accurate translation for voice agents, live captioning, and multilingual apps.
Production-ready English speech translation API
Real conversations in English include accents, regional dialects, code switching, and domain-specific vocabulary. Soniox is built to handle that variation in a single model, so live translation stays usable across regions and registers.
Soniox preserves formatting in the translated transcript, including names, numbers, addresses, IDs, and domain-specific terms.
English belongs to the Indo-European > Germanic > West Germanic family. Soniox translates between English and related languages, as well as any of 60+ supported targets and 3,600 language pairs in total.
A breakthrough in real-time English translation
Translate English before the sentence ends
Meaning lands as it's spoken, not after the caption catches up.
English paired with 60+ languages
Both source and target. 3,600 pairs total, not English-centric.
High quality English output
Same model across every language, including historically underserved ones.
Native-speaker English STT accuracy
Accurate translation starts with accurate recognition across accents, multilingual speech, and language switching.
English names, numbers, and domain terms
Preserved in both transcript and translation, including phone numbers, emails, and IDs.
{
"model": "stt-rt-v4",
"translation": {
"type": "one_way",
"target_language": "en"
}
}Transcription + translation through a single stream
English translation is built on top of Soniox Speech-to-Text API. Every spoken word is transcribed, and translation streams mid-sentence and both arrive together in a single labeled token stream.
Turn it on by adding a translation block to your request. Translation runs on the same WebSocket and the same model. Translated tokens come back interleaved with transcripts, with no extra round trip.
Live English translation: written and spoken
English speech-to-text translation
Translate live English into written text with the Soniox STT API. Enable real-time translation with a config change. Soniox streams the English transcript and translated text as speech happens.
Use it for English captions, subtitles, meeting translation, agent assist, accessibility tools, and multilingual transcription.
English speech-to-speech translation
Build full spoken English translation by combining Soniox STT and Soniox TTS. Soniox recognizes English, translates it in real time, and speaks the output in the target language with low latency.
Use it for live English interpreters, bilingual voice agents, travel assistants, customer support, and real-time multilingual communication.
Live English translation for captions, conversations and more
Stream live English translation in two modes: one-way translates all speech into a single target language, two-way keeps a bilingual conversation flowing between English and another language.
One-way translation
Translate live speech to or from English into a single target language. Everyone in the conversation sees the same translated stream.
Ideal for live captions, multilingual meetings, broadcasts, lectures, customer calls, and products where many speakers need to be understood in one language.
Two-way translation
Translate between English and another language for live bilingual conversation. Each side speaks naturally and hears the other in their own language.
Soniox supports real-time two-way translation between any two of 60+ supported languages.
Built on the Soniox speech AI platform
English translation is powered by the same infrastructure behind Soniox STT and TTS.
Speech-to-Text
Native-speaker accuracy across 60+ languages, with support for multilingual speech, alphanumerics, speaker diarization, context.
Translation
Real-time streaming translation across 3,600 language pairs, built for high quality and low delay across all supported languages.
Text-to-speech
High-fidelity speech generation in 60+ languages, built for names, alphanumerics, language switching, and ultra-low-latency streaming.
Together, they create a complete real-time low-latency speech AI platform.
quizAbout English
English has roughly 1,500,000,000 speakers across United States, United Kingdom, Canada, Australia, New Zealand, and Ireland, and 9 more regions.
English is the most widely spoken language in the world when including both native and non-native speakers.
English has almost no grammatical gender and minimal verb inflection compared to other Indo-European languages.
Translate between all languages
From English to any supported language, or any supported language to English. 3,600 language pairs, all real-time.
Frequently asked questions
Does Soniox translate English in real time?arrow_downward
What language pairs are supported for English?arrow_downward
How does two-way English translation work?arrow_downward
Does Soniox handle dialects and accents in English?arrow_downward
Can Soniox translate names, numbers, and domain terms in English?arrow_downward
Which other providers support live English translation?arrow_downward
How fast is English translation?arrow_downward
What can I build with English translation?arrow_downward
Ready to get started?
Create an account instantly, or contact us to design a custom package for your business.
Build with API arrow_right_altDocumentation
Get up and running in minutes and spend your time building the product, not wrestling with the API.
Explore docsSee what you’ll pay
Pay only for what you use with our flexible pricing. Built to scale with you.
Pricing details