New: Soniox Text-to-Speech is here

Make accurate, production-ready voice agents in Galego

One API for Galician (Galego) speech-to-text, text-to-speech, and live translation, production-ready at scale, with precise alphanumeric recognition for phone numbers, codes, and IDs.

Galician voice layer around your LLM

Building a voice agent can be tricky. One mis-recognized word (a name, an account number, an accent) and the user feels it, and frustration sets in fast. The agent needs speech-to-text that captures every word and text-to-speech that speaks back without slips, and recovers quickly when a mistake happens. And since users today don't all speak English, the same has to hold across multiple languages to reach a global audience.

Soniox handles the real-time Galician speech pipeline with native-speaker accuracy in Galician and 60+ other languages, with accurate listening and clean, natural speech. The voice layer is solved, you just wire in your preferred LLM to do the thinking.

record_voice_over
STT Listens
Real-time Galician speech-to-text with native-speaker accuracy and multilingual support.
psychology
LLM Thinks
<Your favorite LLM>
Plug in any large language model (GPT, Claude, Gemini, Llama) to power reasoning.
graphic_eq
TTS Speaks
Low-latency Galician text-to-speech with exact rendering of numbers, codes, and names.

If you tried our introduction demo above, you saw the voice loop in action: streaming speech-to-text, LLM reasoning, and streaming text-to-speech working together in real time. We put the same underlying architecture into an open-source reference app you can clone, run locally, and adapt to your own use case and style.

Get your Galician voice agent running

The Soniox Voice Agent demo is an open-source voice-to-voice assistant you can clone, run locally, and adapt to Galician. The default scenario is an appointment-booking agent for a fictional car repair shop, but the same architecture works for any voice agent (support, intake, scheduling, anything that needs to listen and speak).

How the voice loop fits together

Microphone audio runs through Silero VAD for barge-in detection, then into Soniox STT for streaming transcription with semantic endpoint detection. The transcript flows to an LLM, which streams its reply token by token straight into Soniox TTS. Soniox TTS is itself a real-time full-duplex streaming model: text flows in on the WebSocket while audio flows out on the same connection at the same time. It starts streaming back audio from the first few words, and as the LLM keeps producing tokens, Soniox keeps turning them into audio as they arrive. The user hears the reply as it forms, with no wait for the LLM to finish before the voice starts.

What's in the repo

  • Python server: orchestrates VAD, STT, LLM, and TTS, and holds the conversation state.
  • React frontend: captures mic audio in the browser and plays the agent's reply.
  • Twilio proxy (optional): connect the same agent to a phone number.

Get it running

  1. Create an account on Soniox Console and generate your Soniox API key.
  2. Clone Soniox Examples repo that contains apps/soniox-voice-bot-demo code and follow the READMEs in the /server and /frontend folders to install dependencies and set your API key.
  3. Adapt the system prompt, STT language hints and TTS language to Galician. See STT language hints and TTS supported languages.
  4. Start the server and the frontend, open it in your browser, and have a Galician conversation.

Learn more about the demo and all possible STT and TTS API configurations and concepts from our comprehensive docs page.

Plug Soniox into your framework of choice

If you don't want to wire up the voice loop yourself, and you're already using a popular voice-agent framework, Soniox plugs in as the STT and TTS through ready-made integrations.

  • Pipecat (voice-agent framework): drop in Soniox STT and TTS through the official STT and TTS packages.
  • LiveKit (real-time audio platform): use Soniox as the speech layer for LiveKit voice agents in the browser, on mobile, or over telephony.

Soniox also provides official integrations with LangChain, Twilio, n8n and more.

The new standard for Galician voice AI

Soniox unifies Galician speech-to-text, text-to-speech, and translation in one platform, delivering lower latency, simpler architecture, and native-speaker Galician accuracy through a single API.

hub

One speech API for the full Galician voice stack

Use Galician speech-to-text, text-to-speech, and translation through a single API and provider. Reduce integration complexity, simplify system design, and ship Galician voice products faster.

bolt

Lower latency across every Galician turn

Run Galician transcription, translation, and speech generation on one real-time platform built for live interaction. Deliver faster turn-taking and more natural Galician conversations.

record_voice_over

Galician voice agents with native-speaker accuracy

Build voice agents that recognize and generate Galician speech with native-speaker accuracy, including code-switching across 60+ languages.

pin

Precise alphanumerics in Galician

Capture and speak email addresses, phone numbers, addresses, IDs, and codes in Galician with the precision production voice agents require.

The complete Galician speech stack for voice agents

One API provides the building blocks of Galician voice agents: recognize Galician speech, generate Galician speech, translate live across 60+ languages, and stream in real time with low latency.

language

Native-speaker Galician speech recognition

Recognize Galician speech across accents, names, numbers, and domain-specific vocabulary with unmatched accuracy, even in noisy, multi-speaker conversations.

equalizer

Galician text-to-speech built for precision

Generate natural, high-fidelity Galician speech built for alphanumerics, names, borrowed words, language switching, and other hard production TTS cases.

translate

Translation for multilingual Galician conversations

Translate spoken Galician content in real time across 60+ languages and 3,600+ language pairs, including conversations where speakers switch languages mid-sentence.

speed

Low-latency streaming for live Galician interaction

Transcribe, translate, and generate Galician speech in real time with low-latency streaming built for voice agents, live conversations, and interactive products.

One global API, deployed locally

Use the same models and API everywhere, with in-region processing to meet latency, data residency, and regulatory requirements.

Soniox Data Residencyarrow_right_alt

Privacy and compliance, built right in

Never stored, never saved.

Audio stays in memory, everything is processed in real-time.

Built for privacy-critical use cases.

Adhering to leading global security, privacy, and compliance standards.

Trusted where privacy matters most.

Used in industries where speech is sensitive, from healthcare to enterprise.

Soniox is Soc 2 Type 2 compliant
Soniox is ISO 27001:2022 compliant
Soniox is HIPAA compliant
Soniox is GDPR compliant
SOC 2 Type 2 · ISO/IEC 27001:2022 · HIPAA · GDPR
Trusted by startups and enterprises

Powering the world's most demanding products

From global enterprises to frontier AI labs, teams choose Soniox for the accuracy, speed, and scale their products demand.

Perplexity integrated Soniox to power a best-in-class voice experience for millions of Perplexity users.

A global technology leader using Soniox across internal meetings, call centers, and government projects in Korea.

Using Soniox for real-time captions and voice interactions, helping bring faster and more natural speech experiences to users.

Using Soniox to power transcription and real-time speech translation across meetings and contact center products.

An enterprise AI agent platform, using Soniox to power voice AI agents across non-English markets where best-in-class voice AI is scarce.

Pioneers in AI-powered healthcare technology, dedicated to transforming the way healthcare providers deliver care.

Using Soniox for best-in-class real-time captioning in its widely used meeting notes platform.

Trusted by millions of people worldwide, using Soniox to power highly accurate transcription for phone calls and voice messages across multiple languages.

It just gets the words right — any language, any accent, any context. That’s what accuracy is supposed to look like.

Tony Wang

Cofounder & Chief Revenue Officer at Agora

We tried a dozen speech-to-text and translation services. Soniox is the best, so that's what we use.

Cayden Pierce

CEO/CTO at Mentra

A fast-growing real-time translation app, using Soniox to power low-latency speech translation for seamless multilingual communication.

As the leading provider of voicebots for automotive dealerships in Germany, we’ve faced significant challenges recognizing license plates accurately. Soniox has solved this problem with exceptional recognition of alphanumeric sequences, resulting in a much higher acceptance rate for our voicebot.

Dr. Steven Zielke

Founder & CEO of mobilApp

It’s so fast, captions appear before people even finish talking. Zero lag. No buffering. Nothing.

Dag-Inge Aas

Head of AI at Tana

Frequently asked questions

Does Soniox support real-time Galician voice agents?arrow_downward
Yes. Soniox provides real-time Galician speech-to-text and streaming full-duplex text-to-speech designed for low-latency voice agents, with sub-200ms STT latency and TTS that starts speaking before the full sentence is available.
Can I build Galician voice agents with one API?arrow_downward
Yes. Soniox unifies speech-to-text, text-to-speech, and real-time translation in a single API. You can build, deploy, and scale Galician voice agents without managing separate providers or stitching pipelines together.
How accurate is Soniox for Galician voice agents?arrow_downward
Soniox delivers native-speaker accuracy in Galician (Galego), built from the ground up across 60+ languages rather than optimized for English first. Both recognition and speech generation handle accents, names, and domain-specific vocabulary.
Can Soniox handle mid-sentence language switching with Galician?arrow_downward
Yes. Both STT and TTS handle language switching mid-sentence, accurately recognizing and generating mixed-language speech when Galician is combined with English or other languages, with no manual configuration required.
How does Soniox handle phone numbers, codes, and IDs in Galician?arrow_downward
Soniox captures alphanumerics (phone numbers, email addresses, reference IDs, and codes) exactly as spoken in Galician, and TTS pronounces them precisely, character by character. Critical for production voice agents that take orders, verify identity, or read confirmations.
Does Soniox support real-time translation between Galician and other languages?arrow_downward
Yes. Soniox supports real-time, mid-sentence translation between Galician and any of 60+ supported languages, covering 3,600+ language pairs. Translation streams continuously as people speak, not after they finish.
Is the Galician voice platform production-ready?arrow_downward
Yes. Soniox runs on production-hardened infrastructure with 99.9% uptime, priority support, and in-region deployment for data residency. Currently available in the US, EU, and Japan, with more regions coming soon. Soniox is SOC 2 Type 2 compliant, ISO 27001 certified, and supports HIPAA and GDPR compliance.
How do I get started with Galician voice agents?arrow_downward
Sign up for an API key, then follow the documentation to wire up Galician STT, TTS, and translation through a unified API.
Build with API · Explore docs

Get started with the Soniox API

Create an account instantly, or contact us to design a custom package for your business.

Build with API arrow_right_alt

Documentation

Get up and running in minutes and spend your time building the product, not wrestling with the API.

Explore docs

See what you’ll pay

Pay only for what you use with our flexible pricing. Built to scale with you.

Pricing details