New: Soniox v5 Real-Time is here

Persian to English speech translation API

Stream Persian (فارسی) speech and get English (English) back in real time. One WebSocket, ISO codes fa to en, and ultra-low latency for voice agents and live apps.

Trusted by teams building global voice products

Production-ready Persian to English translation API

Real Persian speech includes accents, regional dialects, code switching, and domain-specific vocabulary. Soniox recognizes it in a single model and streams English while the speaker is still talking.

Persian (Indo-European > Iranian > Western Iranian) and English (Indo-European > Germanic > West Germanic) come from different language families, so word order and morphology differ. Soniox reorders meaning in-stream instead of word by word.

Persian is written in Arabic and English in Latin, so Soniox emits correctly scripted English text. Right-to-left output is handled directly in the token stream.

A breakthrough in real-time Persian to English translation

Translate before the sentence ends

English meaning lands as Persian is spoken, not after the caption catches up.

Directional fa to en streaming

Set the source and target codes once. Both arrive in a single labeled token stream.

High quality English output

Same model across every language, including historically underserved ones.

Native-speaker Persian STT accuracy

Accurate English translation starts with accurate Persian recognition across accents and language switching.

Names, numbers, and domain terms

Preserved across the pair, including phone numbers, emails, and IDs.

config.json
{
  "model": "stt-rt-v5",
  "translation": {
    "type": "one_way",
    "source_language": "fa",
    "target_language": "en"
  }
}

Persian and English through a single stream

Persian to English translation is built on top of Soniox Speech-to-Text API. Every spoken word is transcribed, and English translation streams mid-sentence in the same labeled token stream.

Turn it on by adding a translation block with source_language: "fa" and target_language: "en". It runs on the same WebSocket and the same model, with no extra round trip.

Live Persian to English: written and spoken

Persian to English speech-to-text

Live Persian speechEnglish text

Translate live Persian into written English with the Soniox STT API. Soniox streams the Persian transcript and the English translation as speech happens.

Use it for English captions, subtitles, meeting translation, agent assist, and multilingual transcription.

Persian to English speech-to-speech

Live Persian speechSpoken English

Build full spoken Persian to English translation by combining Soniox STT and Soniox TTS. Soniox recognizes Persian, translates it, and speaks English with low latency.

Use it for live interpreters, bilingual voice agents, travel assistants, and customer support.

Live Persian to English translation in action

Stream Persian to English one-way to push all speech into English, or two-way to keep a bilingual conversation flowing between the two languages.

Persian speaker says: فقط من یه meeting آنلاین ساعت چهار دارم؛ اگه زودتر بریم، OK ام.
Translated into English in real time.

One-way translation

Translate live Persian into English. Everyone in the conversation sees the same translated stream.

Ideal for live captions, multilingual meetings, broadcasts, lectures, and customer calls.

Persian speaker talks in Persian.
English speaker hears English.
English speaker replies in English.
Persian speaker hears Persian.

Two-way translation

Translate between Persian and English for live bilingual conversation. Each side speaks naturally and hears the other in their own language.

Soniox supports real-time two-way translation between any two of 60+ supported languages.

Accurate on both ends of the pair

Soniox transcribes Persian at 1.25% word error rate and English at 1.25% word error rate. Accurate recognition on both sides is what makes the translation reliable.

Speech-to-Text

Native-speaker accuracy across 60+ languages, with support for multilingual speech, alphanumerics, speaker diarization, context.

Translation

Real-time streaming translation across 3,600 language pairs, built for high quality and low delay across all supported languages.

Text-to-speech

High-fidelity speech generation in 60+ languages, built for names, alphanumerics, language switching, and ultra-low-latency streaming.

Together, they create a complete real-time low-latency speech AI platform.

About Persian and English

Persian has roughly 110,000,000 speakers across Iran, Afghanistan, and Tajikistan. English has roughly 1,500,000,000 speakers across United States, United Kingdom, Canada, and Australia.

Persian is one of the world's oldest literary languages, with a written tradition spanning over 2,500 years.

English is the most widely spoken language in the world when including both native and non-native speakers.

Soniox makes Persian to English usable in real-time translation across every supported pair.

Frequently asked questions

How do I translate Persian to English with the API?
Add a translation block to your real-time request with source_language "fa" and target_language "en". Soniox transcribes Persian and streams the English translation over the same WebSocket.
Is Persian to English translation real-time?
Yes. Soniox streams English while Persian is still being spoken, so meaning arrives mid-sentence instead of after the sentence ends.
What about translating English to Persian?
That direction is supported too. See the English to Persian page, or use two-way translation to run both directions in one session.
Does Soniox handle Arabic to Latin output?
Yes. Soniox outputs correctly scripted English text in Latin, including right-to-left rendering, directly in the token stream.
Does Soniox handle Persian dialects and accents?
Yes. Soniox handles Persian dialects like Iranian Persian, Dari (Afghanistan), and Tajik (Tajikistan) in a single model, so English translation stays accurate across regions.
Which other providers support Persian to English?
Based on their public docs, OpenAI, Google, and Azure list both Persian and English for real-time translation. Soniox is the only one that also supports two-way live translation across 60+ languages.
How fast is Persian to English translation?
Soniox streams English as Persian is being spoken, with ultra-low latency. Translation arrives before the sentence ends.

Simple, usage-based pricing

Translate live audio from Persian to English from ~$0.18/hour.

Translation is already built into Soniox Speech-to-Text API. When turned on, it adds about ~$0.06/hour in output token costs.

Ready to get started?

Create an account instantly, or contact us to design a custom package for your business.

Build with API

Documentation

Get up and running in minutes and spend your time building the product, not wrestling with the API.

Explore docs

See what you’ll pay

Pay only for what you use with our flexible pricing. Built to scale with you.

Pricing details