GitHub - paratustra/audio-transcription-bot: Audio Transcription WhatsApp Bot using Whisper

Audio Transcription WhatsApp Bot

Flask service that receives WhatsApp audio messages via Twilio and replies with a transcription using OpenAI Whisper.

Why

WhatsApp voice notes are slow to skim. This bot turns them into text.

Features

Loads Whisper model once at startup
Validates Twilio signatures
Supports WhatsApp voice notes (OGG/Opus) and general audio/*
Sync or async replies via env flag
Health endpoint at /healthz

Requirements

Python 3.10+
ffmpeg installed on system path
Twilio WhatsApp Sandbox or Business API

Install

pip install -r requirements.txt

Install ffmpeg if you don't have it:

brew install ffmpeg

Configure

Create a .env file:

ACCOUNT_SID=ACxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
AUTH_TOKEN=your_auth_token
FROM=whatsapp:+1415xxxxxxx
MODEL_NAME=small
PORT=5000
DEBUG=false
ASYNC_REPLY=false

Alternatively, copy and edit env.sample to .env.

Notes:

FROM can be configured with or without the whatsapp: prefix; the app handles both.
MODEL_NAME options: tiny, base, small, medium, large (trade speed vs accuracy).
ASYNC_REPLY=true returns immediately and sends the transcription in a second message.

Run

python main.py

Expose locally for Twilio callbacks:

ngrok http 5000

Set your Twilio WhatsApp sandbox Inbound Webhook URL to:

POST https://<your-ngrok-domain>/whatsapp

Usage

Send a voice note or audio file to your Twilio WhatsApp number
You will receive the transcription back

Notes on Cost

Each audio incurs costs for messaging and compute. Whisper model size affects speed and cost; smaller models are faster and cheaper to run.

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
.gitignore		.gitignore
LICENSE.md		LICENSE.md
README.md		README.md
env.example		env.example
main.py		main.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Audio Transcription WhatsApp Bot

Why

Features

Requirements

Install

Configure

Run

Usage

Notes on Cost

About

Uh oh!

Releases

Packages

Uh oh!

Languages

License

paratustra/audio-transcription-bot

Folders and files

Latest commit

History

Repository files navigation

Audio Transcription WhatsApp Bot

Why

Features

Requirements

Install

Configure

Run

Usage

Notes on Cost

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages