Subtitle Generator

A Python project for generating subtitles from audio files using Voice Activity Detection (VAD) and Speech-to-Text (STT) models.

The subtitle language matches the language of the selected Vosk model. Subtitles can be generated for any language supported by both Vosk and Silero-VAD models.

You can see results of the project on example folder.

Features Checklist

~~Voice Activity Detection (VAD) using Silero VAD~~
~~Speech-to-Text (STT) using Vosk (English )~~
~~Support any vosk model.~~
~~Subtitle file generation (SRT format)~~
Translation support for other languages
Text-to-Speech (TTS) integration
~~CLI interface~~
GUI interface
Docker support

Current Limitations

The subtitle language is determined by the Vosk model you use.
Subtitles can be generated for any language supported by both Vosk and Silero-VAD.

Installation

Clone the repository:

git clone https://github.com/DataSciense-py/subtitel-generator.git
cd subtitel-generator

Install dependencies using Poetry:
```
poetry install
```
Download the required Vosk model and place its folder inside the models/vosk/ directory. For example, for English, use models/vosk/vosk-en (where vosk-en is any Vosk model folder you want to use). No additional setup is required: the program will automatically use the model from the specified folder.

Usage Example

See src/main.py for a runnable example. Basic usage:

from subtitel_generator import SubtitelGenerator
from subtitel_generator.file_generator import SrtSubtitleFileGenerator
from subtitel_generator.speech_to_text import VoskSTT
from subtitel_generator.voive_activation_detector import VADSilero

s = SubtitelGenerator(
    vad=VADSilero(),
    stt=VoskSTT(),
    file_generater=SrtSubtitleFileGenerator(),
)

s.generate(audio_file_path="example/Example_audio_endlish_small.wav") # Path to the audio file

Or you can use the CLI interface:

cd src

python cli.py

And write full file path (audio or video)

Project Structure

src/ — Source code
- subtitel_generator/ — Main package
  - __init__.py — Package initialization
  - subtitel_model - Subtitel model
  - subtitel_generator.py — Main orchestration class
  - file_generator/ — Subtitle file generators (e.g., SRT)
  - speech_to_text/ — Speech-to-text models (Vosk)
  - voive_activation_detector/ — Voice activity detection (Silero)
  - utils/ — utilities
    - logging — Logging utilities
- cli.py — Command-line interface
- main.py — Main script
models/ — Pretrained models (e.g., Vosk)
example/ — Example audio files
tests/ — Unit tests

Testing

Run tests with:

poetry run pytest

License

This project is licensed under the terms of the APACHE2.0 License.

Contributing

Contributions are welcome! Please open issues or submit pull requests.

Thanks

Test video Автор: shovonrdm

Name		Name	Last commit message	Last commit date
Latest commit History 29 Commits
example		example
src		src
tests		tests
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
poetry.lock		poetry.lock
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Subtitle Generator

Features Checklist

Current Limitations

Installation

Usage Example

Project Structure

Testing

License

Contributing

Thanks

About

Uh oh!

Releases 3

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Subtitle Generator

Features Checklist

Current Limitations

Installation

Usage Example

Project Structure

Testing

License

Contributing

Thanks

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 3

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages