Quick and dirty script suite for curating khanacademy video transcripts
Usage
MODULE="Early math"
URL="https://www.khanacademy.org/math/early-math"
python download_video_ids.py "${MODULE}" "${URL}"
python download_transcripts.py
python parse_transcripts.py
Requirements
requests
TODO
- Refactor
parse_transcripts.py
- Abstract file containing video ids
- Save video ids using directory hierarchy
- Auto detect subject(e.g. Math => Linear Algebra, Calculus ...)
- Only download necessary tutorials