GitHub - mohiitaa/phone_number_recognition

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
__pycache__		__pycache__
data		data
misc		misc
pretrained_model		pretrained_model
speech_rec_pytorch		speech_rec_pytorch
static		static
templates		templates
README.md		README.md
application.py		application.py
contact.png		contact.png
demo.py		demo.py
environment.yml		environment.yml
model.py		model.py
utils.py		utils.py

Repository files navigation

Create the required environment

conda env create -f environment.yml
conda activate ufo

Directory Structure

The directory contains the following folders:

data/: contains some audio samples at audio/. The speech chunks get saved at audio/ and MFCC images get saved at images/
speech_rec_pytorch/: contains files to train a digit recognition model from scratch
pretrained_model/: contains pretrained model. Replace with the model you want to use. I am using speech_net_aug.pth.tar.
static/ and templates/: For the Web App part
misc/: Some utility functions

Files

To run the Web App:

export FLASK_APP=application.py
flask run.

Then navigate to http://127.0.0.1:5000/.

To run a simple demo

python demo.py -pn <file_name from audio/>

Demo

About

No description, website, or topics provided.

Report repository

Releases

No releases published

Packages

No packages published

Languages

Python 83.6%
HTML 16.4%