A Low-latency Voice Bot built with Modal and Pipecat

A real-time conversational AI bot powered by Pipecat and deployed on Modal. This project features an interactive RAG (Retrieval-Augmented Generation) system with real-time speech-to-speech interaction.

Blog post

Installation

1. Clone the Repository

git clone git@github.com:modal-projects/open-source-av-ragbot.git
cd open-source-av-ragbot

2. Set Up Python Environment

This project uses uv for Python package management:

# Install dependencies
uv sync

# Activate virtual environment
source .venv/bin/activate

3. Configure Modal

Go to modal.com and make an account if you don't have one.

# Authenticate your Modal installation
modal setup

4. Set Up Client

Install dependencies

cd client
npm i

Build

npm run build

# return to root dir
cd ..

Deployment

Deploy All Services

The project consists of multiple Modal services that need to be deployed:

# From the root dir of the project

# Deploy an LLM Service

# etiher VLLM inference server for optimized TTFT
modal deploy -m server.llm.vllm_server

# or use SGLang server for 
# aster cold starts with GPU snapshots
modal deploy -m server.llm.sglang_server

# Deploy Parakeet STT service
modal deploy -m server.stt.parakeet_stt

# Deploy Kokoro TTS service
modal deploy -m server.tts.kokoro_tts

# Deploy main bot application with frontend
modal deploy -m app

Warmup Snapshots

We can speed up the cold start time of our bot (this is more important) and our Parakeet and LLM service (if using SGLang) using snapshots. However this leads to extra start up time for the first few containers when the apps are (re-)deployed. To warmup snapshots, you can run these files as Python scripts.

python -m server.stt.parakeet_stt

python -m server.llm.sglang_server

python -m app

Name		Name	Last commit message	Last commit date
Latest commit History 102 Commits
client		client
server		server
utils		utils
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
app.py		app.py
pyproject.toml		pyproject.toml
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

A Low-latency Voice Bot built with Modal and Pipecat

Installation

1. Clone the Repository

2. Set Up Python Environment

3. Configure Modal

4. Set Up Client

Install dependencies

Build

Deployment

Deploy All Services

Warmup Snapshots

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

A Low-latency Voice Bot built with Modal and Pipecat

Installation

1. Clone the Repository

2. Set Up Python Environment

3. Configure Modal

4. Set Up Client

Install dependencies

Build

Deployment

Deploy All Services

Warmup Snapshots

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages