Automatic Video Subtitling System for Arabic/English Using Speech Recognition
The goal of this project is to develop an application that allows subtitling of English and Arabic. The application will provide the user with a full subtitle file at the end of the execution. The subtitling will be based on the CMU-Sphinx speech recognition engine which is open source GNU software.
In this project, the ASR engine will be mostly used as a black box and the role of the student would be to understand how to integrate the engine with his application.
- Extract audio from video FFMPRG
- Convert audio to 16khz one channel FFMPRG
- Finding out where each speech pause in the audio LIUM
- Segmenting the audio at each pause using FFMPRG
- Speech recognize each segment ( CMU SPHINX )
- Save to an srt file
- LIUM: Detects segments
- FFMPEG: Converts and segemetns audio
- SPHINX: Speech Recognition Engine
- The decoder source code
- The language dictionary
- The acoustic model
- The language model
- The test data
If you want to contact me you can reach me at [email protected]
.
This project uses the following license: GNU.