Studio 415 ml meetup.
The bash script grabDependencies.sh will download data and clone several repositories that contain examples and dependencies. It will create a directory data and subdirectories to contain the MNIST, CIFAR-10 and CIFAR-100 datasets.
The python scripts in the code directory expect these directories to exist, as well as an additional directory, data/cifarKaggle, which must be populated by hand using the data from Kaggle.com. Expected files and subdirectories are
data/cifarKaggle/trainLabels.csvfrom http://www.kaggle.com/c/cifar-10/download/trainLabels.csvdata/cifarKaggle/trainthe unzipped contents of http://www.kaggle.com/c/cifar-10/download/train.7zdata/cifarKaggle/testthe unzipped contents of http://www.kaggle.com/c/cifar-10/download/test.7z
The python scripts in the code directory will try to identify the location of the cifar-ten directory. First they will check if there is an environment variable CIFAR10_HOME, in which case they will assume that is the location of the cifar-ten directory. If that environment variable is not set, then it will look for python scripts in directory $HOME/cifar-ten/code/ and data files in subdirectories of $HOME/cifar-ten/data.
Currently the python scripts in code require packages numpy, scipy, Theano and PyWavelets. To install these packages system-wide, invoke
sudo pip install Theanosudo pip install PyWavelets
If local installation is sufficient or preferred, invoke pip with the --user flag.