Soniox API
Turn clinical conversations into structured medical transcription
Build apps that capture, transcribe, and translate medical speech – from bedside conversations to clinical dictation – in real-time. Support 60+ languages, custom medical terms, and HIPAA-sensitive use cases, all with one secure API.
Helping startups and enterprises ship real world voice apps
For healthcare apps that help doctors listen, speak, and care
Capture doctor-patient conversations automatically
Securely record free-flowing clinical conversations without prompts or setup. Get structured transcripts in real-time, for summaries, charting, or EHRs.
Dictation that lets providers speak freely and move fast
No need to pause or slow down. Transcribe spontaneous, unstructured speech with accuracy and speed, outputting clean, organized notes.
Real-time translation for more inclusive care
Help providers and patients communicate clearly across language barriers. Transcribe and translate medical speech instantly.
Smarter documentation that integrates seamlessly
Use structured transcripts to automate notes, billing, and more. With speaker labels, timestamps, and formatting that fit directly into clinical workflows.
Deliver better care with less documentation overhead
Stay engaged while Soniox takes the notes.
No typing or distractions. Soniox captures and organizes doctor-patient conversations in real-time, so providers can focus on listening, understanding, and delivering better care.
Structure fast, free-flowing medical speech.
From rapid dictation to natural back-and-forth, Soniox keeps up with real-world medical terminology and outputs clean, structured transcripts. No cleanup required.
Connect with patients in their own language.
Transcribe and translate patient speech in real-time so providers and patients can speak directly. No delays, confusion, or need for human interpreters.
Built for HIPAA-sensitive apps and workflows.
Audio is streamed and processed entirely in memory. Nothing is stored. With SOC 2 Type 2 and ISO/IEC 27001:2022-certified infrastructure, Soniox is secure for any app handling regulated medical speech and data.
One secure API for every clinical voice workflow.
Simplify your voice stack across fragmented healthcare systems. Soniox delivers transcription, translation, formatting, and speaker logic through a single API that plugs into your existing stack.
Try it live. Start talking.
Put Soniox to the test. See how our medical transcription API stacks up against others »
Speech infrastructure for massive scale

Build on one API and deploy in your region
Use the same models and API everywhere, with in-region processing to meet latency, data residency, and regulatory requirements.
Available: US, EU, Japan
Coming soon: Korea, Australia, Canada, India, Saudi Arabia, UK, Brazil

Run mission-critical systems with confidence
- 99.9% uptime
Production-hardened infrastructure with monitoring and redundancy. - Ultra-low-latency streaming
Process speech in real time with low latency for responsive voice applications. - Priority support
Severity-based incident response with direct access to the Soniox team.
"Before Soniox, our international users always had a noticeably different experience. Now accuracy and responsiveness match across all regions…it feels like one system instead of five."
Alon Yair CTO of Onvego
Privacy and compliance, built right in
Never stored, never saved.
Audio stays in memory, everything is processed in real-time.
Built for privacy-critical use cases.
Adhering to leading global security, privacy, and compliance standards.
Trusted where privacy matters most.
Used in industries where speech is sensitive, from healthcare to enterprise.




Ready to get started?
Create an account instantly, or contact us to design a custom package for your business.
Build with API arrow_right_altDocumentation
Get up and running in minutes and spend your time building the product, not wrestling with the API.
Explore docsSee what you’ll pay
Pay only for what you use with our flexible pricing. Built to scale with you.
Pricing details