Live English speech translation API

Translate English speech to and from 60+ languages in real time. Low-latency streaming, 3,600 language pairs, and accurate translation for voice agents, live captioning, and multilingual apps.

Trusted by teams building global voice products

Production-ready English speech translation API

Real conversations in English include accents, regional dialects, code switching, and domain-specific vocabulary. Soniox is built to handle that variation in a single model, so live translation stays usable across regions and registers.

Soniox preserves formatting in the translated transcript, including names, numbers, addresses, IDs, and domain-specific terms.

English belongs to the Indo-European > Germanic > West Germanic family. Soniox translates between English and related languages, as well as any of 60+ supported targets and 3,600 language pairs in total.

A breakthrough in real-time English translation

config.json
{
  "model": "stt-rt-v5",
  "translation": {
    "type": "one_way",
    "target_language": "en"
  }
}

Translate English before the sentence ends

Meaning lands as it's spoken, not after the caption catches up.

English paired with 60+ languages

Both source and target. 3,600 pairs total, not English-centric.

High quality English output

Same model across every language, including historically underserved ones.

Native-speaker English STT accuracy

Accurate translation starts with accurate recognition across accents, multilingual speech, and language switching.

English names, numbers, and domain terms

Preserved in both transcript and translation, including phone numbers, emails, and IDs.

English translation is built on top of the Soniox Speech-to-Text API. Turn it on by adding a translation block to your request. Translated tokens come back interleaved with transcripts on the same stream, with no extra round trip.

Live English translation: written and spoken

Live Transcription
SpanishTranscript

EnglishTranslation

English speech-to-text translation

Translate live English into written text with the Soniox STT API. Enable real-time translation with a config change. Soniox streams the English transcript and translated text as speech happens.

Use it for English captions, subtitles, meeting translation, agent assist, accessibility tools, and multilingual transcription.

Realtime Translator
HarutoJapanese
Speaking

EmmaEnglish

English speech-to-speech translation

Build full spoken English translation by combining Soniox STT and Soniox TTS. Soniox recognizes English, translates it in real time, and speaks the output in the target language with low latency.

Use it for live English interpreters, bilingual voice agents, travel assistants, customer support, and real-time multilingual communication.

One-way or two-way English translation

One-way translates all speech into a single target language. Two-way keeps a bilingual conversation flowing between English and another language, so each side speaks naturally and hears the other in their own.

YouTube

One-way translation

Translate live speech to or from English into a single target language. Everyone in the conversation sees the same translated stream.

Ideal for live captions, multilingual meetings, broadcasts, lectures, customer calls, and products where many speakers need to be understood in one language.

Live Conversation
Sofía

Two-way translation

Translate between English and another language for live bilingual conversation. Each side speaks naturally and hears the other in their own language.

Soniox supports real-time two-way translation between any two of 60+ supported languages.

Simple, usage-based pricing

Start translating live English audio streams from ~$0.18/hour.

Translation is already built into Soniox Speech-to-Text API. When turned on, it adds about ~$0.06/hour in output token costs.

About English

English has roughly 1,500,000,000 speakers across United States, United Kingdom, Canada, Australia, New Zealand, and Ireland, and 9 more regions.

English is the most widely spoken language in the world when including both native and non-native speakers.

English has almost no grammatical gender and minimal verb inflection compared to other Indo-European languages.

Frequently asked questions

Does Soniox translate English in real time?
Yes. Soniox streams English translation while speech is still happening, so users see or hear meaning immediately instead of waiting for the sentence to end.
What language pairs are supported for English?
Soniox supports translation between English and any of 60+ supported languages — 3,600 language pairs in total. Both English as the source and English as the target are supported.
How does two-way English translation work?
Two-way translation lets both sides of a conversation speak in their own language and hear the other side in theirs. A English speaker can talk in English while the other party hears their own language in real time, and vice versa.
Does Soniox handle dialects and accents in English?
Yes. Soniox handles English dialects like American English, British English, and Australian English in a single model, along with accents and language switching, so live translation stays accurate across regions and registers.
Can Soniox translate names, numbers, and domain terms in English?
Yes. Soniox preserves the details that matter — names, phone numbers, emails, IDs, addresses, verification codes, and domain-specific terminology — in both the English transcript and the translated output.
Which other providers support live English translation?
Based on their public docs, OpenAI, Google, Azure, and Speechmatics list English as a supported source language for real-time translation. Soniox is the only one of them that also supports two-way live translation across 60+ languages.
How fast is English translation?
Soniox streams translated text as English is being spoken, with ultra-low latency. Translation arrives before the sentence ends, rather than after long caption delays.
What can I build with English translation?
Common use cases include multilingual voice agents, live interpreters, multilingual meetings, customer support and contact centers, real-time translated captions and subtitles, and accessibility and communication tools — all with English as a source or target.

Ready to get started?

Create an account instantly, or contact us to design a custom package for your business.

Build with API

Documentation

Get up and running in minutes and spend your time building, not wrestling with the API.

Explore docs

See what you’ll pay

Pay only for what you use with our flexible pricing. Built to scale with you.

Pricing details