Video to SRT Converter Web Service

This web service converts video files (MP4) to SRT subtitle format, with options to translate to Chinese and overlay subtitles onto the original video using the OpenAI Whisper API via AiHubMix.

Switch to Chinese Version | Switch to English Version

Features

Web-based interface for easy file conversion
Converts MP4 files to SRT subtitles with accurate timestamps
Translates subtitles to Chinese or English using configurable AI models
Overlays subtitles onto the original video
Download and process videos from X (Twitter)
Streamable video playback with links for both original and subtitled videos
Configurable API settings with support for custom providers and models
Character count adjustment for wide-screen videos
Progress tracking with one decimal place precision
Responsive web interface

Requirements

Python 3.6+
FFmpeg
ImageMagick (for subtitle overlay)
flask for web interface
moviepy for video/audio processing
openai for API access

Installation

Clone or download this repository
Install dependencies:
```
pip install -r requirements.txt
```

Usage

Running Locally

Configure your API settings as environment variables:

export TRANSCRIBE_API_KEY="your_transcription_api_key"
export TRANSLATE_API_KEY="your_translation_api_key"
# Or use the fallback variable:
# export AIHUBMIX_API_KEY="your_api_key_here"

Run the application:
```
python app.py
```
Open your browser to http://localhost:5000

Web Interface Features

The web interface provides two main ways to process videos:

Upload Local File

Upload MP4 or MP3 files directly from your computer
Choose processing options:
- Translate to Chinese: Convert subtitles to Chinese
- Translate to English: Convert subtitles to English (mutually exclusive with Chinese translation)
- Overlay Subtitles on Video: Add subtitles directly to the video
After processing, download links are provided for:
- Original SRT file
- Translated SRT file (if translation was selected)
- Subtitled video (if overlay was selected)
Video preview links for both original and subtitled versions (opened in new tabs)

Download from X (Twitter)

Enter the URL of an X (Twitter) post with a video
The service will download the video and show processing options
Choose translation (Chinese or English) and/or subtitle overlay options
After processing, download links and video preview links are provided

Video Playback

After successful processing, you can preview both original and subtitled videos
Click "▶️ Play Original Video in New Tab" to play the original video
Click "▶️ Play Video with Subtitles in New Tab" to play the subtitled video
Videos open in new browser tabs for convenient playback

Docker Deployment

Configure your API keys in the docker-compose.yml file or create a .env file:

TRANSCRIBE_API_KEY=your_transcription_api_key
TRANSLATE_API_KEY=your_translation_api_key

Build and run the container:
```
docker-compose up -d
```
Access the service at http://localhost:5000

Ubuntu Server Deployment

Run the deployment script:

chmod +x deploy_ubuntu.sh
sudo ./deploy_ubuntu.sh

Set the API keys:

sudo systemctl set-environment TRANSCRIBE_API_KEY='your_transcription_api_key'
sudo systemctl set-environment TRANSLATE_API_KEY='your_translation_api_key'

Or use the fallback:

sudo systemctl set-environment AIHUBMIX_API_KEY='your_api_key_here'

Access the service at your server's IP address

Configuration

The service supports configurable API settings for both transcription and translation services. You can provide configuration in several ways:

API Configuration

Transcription Settings:

TRANSCRIBE_API_KEY - API key for transcription service (Whisper)
TRANSCRIBE_BASE_URL - API endpoint URL for transcription service (default: https://aihubmix.com/v1)
TRANSCRIBE_MODEL - Model to use for transcription (default: whisper-1)

Translation Settings:

TRANSLATE_API_KEY - API key for translation service (Gemini or other LLM)
TRANSLATE_BASE_URL - API endpoint URL for translation service (default: https://aihubmix.com/v1)
TRANSLATE_MODEL - Model to use for translation (default: gemini-2.5-flash-lite)

Fallback Settings:

AIHUBMIX_API_KEY - Fallback API key (if specific keys are not set)
API_KEY - Final fallback API key

Configuration Methods

You can provide configuration in several ways:

Environment variables (highest precedence)
In systemd service file
In docker-compose.yml file
In a secure config file at config.env (for local development)
In /etc/video-converter/config.env (system-wide config)
In ~/.video-converter/config.env (user-specific config)

The configuration files should follow the format:

TRANSCRIBE_API_KEY=your_transcription_api_key
TRANSCRIBE_BASE_URL=https://your-transcription-service.com/v1
TRANSCRIBE_MODEL=whisper-1
TRANSLATE_API_KEY=your_translation_api_key
TRANSLATE_BASE_URL=https://your-translation-service.com/v1
TRANSLATE_MODEL=your-model-name

Security: API Key Protection

To protect your API keys, follow these security practices:

For Ubuntu Deployment

After running the deployment script, create a secure secrets file:
```
sudo nano /etc/video-converter/secrets
```

Add your API keys to the file:

TRANSCRIBE_API_KEY=your_transcription_api_key
TRANSLATE_API_KEY=your_translation_api_key

Set appropriate permissions:

sudo chmod 640 /etc/video-converter/secrets
sudo chown root:video_converter /etc/video-converter/secrets

Restart the service:
```
sudo systemctl restart video-converter
```

For Docker Deployment

Create a .env file in the project directory:

TRANSCRIBE_API_KEY=your_transcription_api_key
TRANSLATE_API_KEY=your_translation_api_key

Make sure the .env file is not committed to version control by adding it to .gitignore:
```
.env
config.env
```

For Local Development

Create a config.env file in the project directory:

TRANSCRIBE_API_KEY=your_transcription_api_key
TRANSLATE_API_KEY=your_translation_api_key

Make sure this file is in your .gitignore to prevent committing it.

API Endpoints

GET /: Main web interface
POST /upload: Upload and process video files
GET /download/<filename>: Download processed files
GET /upload_stream/<filename>: Stream video from upload directory (for original videos)
GET /output_stream/<filename>: Stream video from output directory (for subtitled videos)
GET /subtitles/<filename>: Serve SRT files as WebVTT for video players (if enabled)
GET /download_progress/<url_hash>: Get download progress for X video downloads
POST /download_x_video: Initiate download of video from X (Twitter) URL
POST /process_x_video: Process a downloaded X video with optional translation
POST /telegram_webhook: Telegram bot webhook (for receiving messages from Telegram)

Telegram Bot Integration

The service includes a Telegram bot integration that allows users to process videos directly through Telegram.

Setting up the Telegram Bot

Create a bot with BotFather on Telegram
Get your bot token

Set the bot token as an environment variable:

export TELEGRAM_BOT_TOKEN="your_bot_token_here"

Set your public URL where the bot can receive webhooks:
```
export PUBLIC_URL="https://your-domain.com"
```
Set the webhook by calling the set_webhook function in the app

Using the Telegram Bot

Once configured, users can:

Send the /start command to get started
Send MP4 or MP3 files to generate subtitles
Receive both original and Chinese translated SRT files
Get help with the /help command

Additional Features

Character Count Adjustment

For wide-screen videos (width > 1280px), the character count per line is doubled (from 16 to 32 characters)
This improves subtitle readability on wide-screen videos by accommodating more text per line
Non-wide-screen videos continue to use the standard 16 characters per line

Progress Tracking

All processing operations show progress with one decimal place precision (e.g., 45.7%)
Progress is tracked both for local file uploads and X video downloads
Progress updates are more granular for better user experience

Translation Options

Support for both Chinese and English translations
Translation options are mutually exclusive (select either Chinese or English)
Customizable line break processing for Chinese text based on video width

X Video Download Feature

Download videos directly from X (Twitter) posts using the video URL
Automatic video format detection and download
Integrated processing options after download
Progress tracking during download

Video Streaming

Both original and subtitled videos can be streamed directly in browser
Separate streaming endpoints for upload and output directories
Video links open in new tabs for convenient playback

Command Line Options

Process a specific MP4 file: python mp4_to_mp3.py path/to/video.mp4
Process all files in a directory: python mp4_to_mp3.py path/to/directory
Process an existing MP3 file: python mp4_to_mp3.py path/to/audio.mp4
Translate to Chinese: Add --translate or -t flag, e.g., python mp4_to_mp3.py --translate path/to/video.mp4
Overlay subtitles on video: Add --overlay or -o flag, e.g., python mp4_to_mp3.py --overlay path/to/video.mp4
Combine flags: python mp4_to_mp3.py --translate --overlay path/to/video.mp4

Open Source

This project is open source and welcomes contributions from the community.

Contributing

Fork the repository and create your branch from main
Add your features and ensure they work properly
Update documentation as needed
Submit a pull request with a clear description of your changes

License

This project is licensed under the MIT License - see the LICENSE file for details.

Support

If you find this project helpful, consider starring the repository and contributing to its development!

Name		Name	Last commit message	Last commit date
Latest commit History 32 Commits
.claude		.claude
extension		extension
templates		templates
.env.example		.env.example
.gitignore		.gitignore
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
README_Desktop.md		README_Desktop.md
README_EXE.md		README_EXE.md
README_zh.md		README_zh.md
VideoTranscriber.spec		VideoTranscriber.spec
analyze_twitter_comprehensive.py		analyze_twitter_comprehensive.py
analyze_twitter_video_alt.py		analyze_twitter_video_alt.py
analyze_x_video.py		analyze_x_video.py
analyze_x_video_enhanced.py		analyze_x_video_enhanced.py
app.py		app.py
audio_to_srt.py		audio_to_srt.py
build_desktop.py		build_desktop.py
config_dialog.py		config_dialog.py
deploy_ubuntu.sh		deploy_ubuntu.sh
desktop_app.py		desktop_app.py
desktop_app.spec		desktop_app.spec
desktop_config.ini		desktop_config.ini
desktop_config.py		desktop_config.py
desktop_processing.py		desktop_processing.py
desktop_requirements.txt		desktop_requirements.txt
docker-compose.yml		docker-compose.yml
download_x_video.py		download_x_video.py
exe_launcher.py		exe_launcher.py
main.py		main.py
mp4_to_mp3.py		mp4_to_mp3.py
overlay_subtitles.py		overlay_subtitles.py
pyproject.toml		pyproject.toml
quick_start.py		quick_start.py
requirements.txt		requirements.txt
setup_telegram_webhook.py		setup_telegram_webhook.py
simple_launcher.py		simple_launcher.py
simple_player.py		simple_player.py
simple_spec.spec		simple_spec.spec
start_app.vbs		start_app.vbs
test_config.env		test_config.env
test_desktop.py		test_desktop.py
test_exe.py		test_exe.py
test_mp3_to_srt.py		test_mp3_to_srt.py
video_download.py		video_download.py
video_to_audio.py		video_to_audio.py
video_transcriber.spec		video_transcriber.spec

Folders and files

Latest commit

History

Repository files navigation

Video to SRT Converter Web Service

Features

Requirements

Installation

Usage

Running Locally

Web Interface Features

Upload Local File

Download from X (Twitter)

Video Playback

Docker Deployment

Ubuntu Server Deployment

Configuration

API Configuration

Configuration Methods

Security: API Key Protection

For Ubuntu Deployment

For Docker Deployment

For Local Development

API Endpoints

Telegram Bot Integration

Setting up the Telegram Bot

Using the Telegram Bot

Additional Features

Character Count Adjustment

Progress Tracking

Translation Options

X Video Download Feature

Video Streaming

Command Line Options

Open Source

Contributing

License

Support

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages