-
conda env create -f environment.yml
-
conda activate ufo
The directory contains the following folders:
-
data/
: contains some audio samples ataudio/
. The speech chunks get saved ataudio/
and MFCC images get saved atimages/
-
speech_rec_pytorch/
: contains files to train a digit recognition model from scratch -
pretrained_model/
: contains pretrained model. Replace with the model you want to use. I am usingspeech_net_aug.pth.tar
. -
static/
andtemplates/
: For the Web App part -
misc/
: Some utility functions
To run the Web App:
-
export FLASK_APP=application.py
-
flask run
.
Then navigate to http://127.0.0.1:5000/
.
To run a simple demo
python demo.py -pn <file_name from audio/>