Just launched: Real-time speech translation AI for the world

June 17, 2025 by Soniox Team

Today, we’re launching real-time multilingual translation in the Soniox API, as well as a brand new mobile app that brings the same technology to your pocket. Together, these launches make Soniox the most powerful and flexible way to build and use voice globally.

With instant translation now streaming in over 60 languages, Soniox unlocks an entirely new class of applications: multilingual meetings, global voice interfaces, AI agents that can understand and respond across languages without delay, and much, much more.

Voice is becoming the next interface.

As global stakes rise, the pressure to build fast, global-ready products is only growing. Real-time voice lets products speak to everyone, everywhere.

Text was the first interface for software. Voice is the next. But to make that shift real, developers need infrastructure they can trust and build on easily.

Until now, building with voice meant tradeoffs: speed vs. accuracy, transcription vs. translation, fast builds vs. global reach. Soniox removes those compromises.

Our AI processes human speech as it happens, across languages, accents, interruptions, and noise. It doesn’t just transcribe. It understands, organizes, and responds. That’s why it works not just for phone calls, but for meetings, agents, accessibility tools, wearables, and more.

New tools for real-time voice, everywhere.

For developers

Build fast, fluid voice experiences with a single API.

Get API key

For everyone

See it in action with the mobile app.

Get the App

Multilingual translation in the Soniox API

Our updated API now supports:

  • Real-time translation in 60+ languages, streaming as users speak.
  • Speaker-aware output to track conversations clearly.
  • Automatic spoken language identification for seamless mid-sentence translation across languages.
  • Structured JSON responses for real-time integration into your stack. No batching, no model switching, just stream and go.
  • Unified transcription + translation in one API stream.

With it, developers can build:

  • Multilingual meeting tools that transcribe, translate, and summarize
  • Voice-driven agents that speak and understand across languages
  • Real-time translation overlays for video, calls, or events
  • An entire new class of wearables and keyboard-free experiences
  • Accessibility features that work for users everywhere

The new Soniox mobile app

Now available on iOS and Android, the app brings Soniox to anyone, anywhere. Whether you're recording a meeting, capturing a doctor's visit, or traveling this summer, the app makes real-time speech AI accessible in everyday moments.

I

With the app, you can:

  • Speak and see transcription and translation happen live, word by word
  • Capture voice notes, meetings, or conversations and get instant, structured output
  • Export results with speaker labels and language metadata for easy sharing or search

It’s the fastest way to experience Soniox and a powerful tool for day-to-day use.

Unlocking global voice experiences.

With this launch, developers can build products that hear the entire world. Businesses can expand reach instantly. Users can talk, be understood, and act in real time, no matter the language.

We’re making voice a first-class part of the modern stack. And in doing so, we’re helping voice reach its full potential: not just as input, but as a full interface for understanding, interaction, and action.

"We want developers to build with voice the way they build with text: programmatically, globally, and in real time. Voice isn’t a novelty. It’s infrastructure. And if you’re building something that listens, speaks, or understands, Soniox should be your foundation."
— Klemen Simonic, CEO of Soniox

Ready to see for yourself? Start here.

We built Soniox to be open, transparent, and ready for real-world use, and we’d love for you to try it. Shoot us a note at hello@soniox.com to let us know what you think!