Soniox

Pipecat

Integrate Soniox Speech-to-Text and Text-to-Speech into Pipecat pipelines.

Soniox x Pipecat

Overview

Pipecat is a framework for building voice-enabled, real-time, multimodal AI applications. A typical Pipecat pipeline for voice applications looks like this:

  1. Send Audio - Transmit and capture streamed audio from the user.
  2. Transcribe Speech - Convert speech to text as the user is talking.
  3. Process with LLM - Generate responses using a large language model.
  4. Convert to Speech - Transform text responses into natural speech.
  5. Play Audio - Stream the audio response back to the user.

Soniox plugs into two stages of this pipeline:

For more details on how Pipecat works, check the Pipecat documentation.

Installation

Install the Soniox extras for Pipecat:

pip install "pipecat-ai[soniox]"

You will also need to set up your Soniox API key as an environment variable:

export SONIOX_API_KEY=<your_soniox_api_key>

You can obtain a Soniox API key by signing up at the Soniox Console.

Services