-
Notifications
You must be signed in to change notification settings - Fork 26
Open
Labels
Description
When downloading corpora from versioned data stores, I would expect to take into account a tag or specific hash of that dataset. That way users are sure if a specific version of audiomate yields an identical corpus to foster reproducibility.
e.g. lets take the esc-50 corpus: the root url downloads directly from master branch
audiomate/audiomate/corpus/io/esc.py
Line 11 in 28696c0
| DOWNLOAD_URL = 'https://github.com/karoldvl/ESC-50/archive/master.zip' |
To improve reproducibility, I suggest that audiomate uses tags where possible (github, zenodo, ...) and furthermore provide a checksum mechanism that verifies a successful download.
This issue is part of a JOSS review openjournals/joss-reviews#2135
Reactions are currently unavailable