Transcriber

Transcriber is a simple Python script that converts audio files (MP3, WAV, WEBM, or MP4) to text using the OpenAI Whisper model. It supports multiple languages and output formats.

Features

Transcribe audio files in MP3, WAV, WEBM, or MP4 format.
Supports multiple Whisper models: tiny, base, small, medium, large.
Specify the language of the audio for accurate transcription.
Output the transcription in different formats: TXT, SRT, JSON.
Verbose mode for detailed transcription process.

Requirements

Python 3.7 or higher
pydub
ffmpeg
whisper

Installation

Clone the repository:

git clone https://github.com/erseco/transcriber.git
cd transcriber

Create and activate a virtual environment (optional but recommended):

python3 -m venv env
source env/bin/activate  # On Windows use `env\Scripts\activate`

Install the required packages:
```
pip install -r requirements.txt
```
Make sure you have ffmpeg installed. You can download it from here and follow the installation instructions for your operating system.

Usage

python transcriber.py <input_file> [--language <language>] [--model <model>] [--output_format <format>] [--verbose]

Arguments

<input_file>: Path to the MP3, WAV, WEBM, or MP4 file to transcribe.
--language: Language of the audio for transcription (default: es for Spanish).
--model: Whisper model to use for transcription (default: medium). Options: tiny, base, small, medium, large.
--output_format: Output format for the transcription (default: txt). Options: txt, srt, json.
--verbose: Enable verbose output during transcription.

Examples

Transcribe an MP3 file to a text file:

python transcriber.py ~/Downloads/audio.mp3 --language en --model small --output_format txt

Transcribe a WEBM file to a JSON file with verbose output:

python transcriber.py ~/Downloads/audio.webm --language fr --model large --output_format json --verbose

Contributing

Contributions are welcome! Please feel free to submit a Pull Request.

License

This project is licensed under the MIT License.

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
.gitignore		.gitignore
README.md		README.md
requirements.txt		requirements.txt
transcriber.py		transcriber.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Transcriber

Features

Requirements

Installation

Usage

Arguments

Examples

Contributing

License

About

Releases

Packages

Languages

erseco/transcriber

Folders and files

Latest commit

History

Repository files navigation

Transcriber

Features

Requirements

Installation

Usage

Arguments

Examples

Contributing

License

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages