Robust Sign Language Detection on WLASL

Vision-based sign language recognition aims to identify specific human hand gestures to convey information. For this project, we present and analyze a system that interprets American Sign Language (ASL) using a vocabulary set from the Word-Level American Sign Language (WLASL) video dataset, which contains more than 2,000 words and 21,000 videos performed by over 100 signers.

Due to limited computational resources, we devised a binary classifier with a maximum of 12 videos per word. We also wrote scripts to preprocess and parse the WLASL dataset for our classification purposes.

Running Instructions:

python video_downloader.py to download the raw videos. The raw videos downloaded can be found under the raw_videos directory.
python video_samples.py to process the raw videos into video samples. The video samples can be found under the videos directory.
python preprocess.py to split the video samples into frames, create training and testing dataset and store them in directories. The frames can be found in the frames directory, separated into training and testing sub-directories.
python model.py to train and evaluate the model. The training and testing accuracy for each epoch will be printed out.

Problems with Data:

Some of the video samples contain random frames even after preprocessing, which affects the accuracy of the model.
Many of the video samples include frames where the signers aren't making any signs and are simply standing still. These frames are hard for the model to classify and negatively affect the accuracy of our model.

The number of words to classify and the number of videos to download for each word can be adjusted by changing the parameters in video_downloader.py and video_samples.py.

Name		Name	Last commit message	Last commit date
Latest commit History 55 Commits
frames		frames
raw_videos		raw_videos
raw_videos_mp4		raw_videos_mp4
runs		runs
scripts		scripts
videos		videos
.gitignore		.gitignore
README.md		README.md
WLASL_v0.3.json		WLASL_v0.3.json
download_1651944444.log		download_1651944444.log
find_missing.py		find_missing.py
missing.txt		missing.txt
missing_words.txt		missing_words.txt
model.py		model.py
preprocess.py		preprocess.py
video_downloader.py		video_downloader.py
video_samples.py		video_samples.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Robust Sign Language Detection on WLASL

About

Releases

Packages

Contributors 2

Languages

Mike-Do/sign-language-recognition

Folders and files

Latest commit

History

Repository files navigation

Robust Sign Language Detection on WLASL

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages