Update readme

lightvector · Jun 2, 2018 · 82f445e · 82f445e
1 parent 535280b
commit 82f445e
Showing 1 changed file with 14 additions and 7 deletions.
diff --git a/README.md b/README.md
@@ -1,8 +1,20 @@
 # Go Neural Net Sandbox
 
-This repo is currently a sandbox for personal experimentation in neural net training in Go. I haven't put much work to make the training pipeline usable by others, since this is foremost a personal sandbox, but if you're interested, the rough summary is:
+This repo is currently a sandbox for personal experimentation in neural net training in Go. I've only put a little work into making this repo usable by others, since this is foremost a personal sandbox, but if you're interested, here's how to get started:
 
-   * You must have HDF5 installed for C++ (https://support.hdfgroup.org/HDF5/release/obtainsrc.html), as well as Tensorflow installed for Python 3.
+   * You must have [Python3](https://www.python.org/) and [Tensorflow](https://www.tensorflow.org/install/) installed.
+   * Either download a pre-trained model (https://github.com/lightvector/GoNN/releases) or train one yourself (see below).
+   * Do things with the trained model
+      * Run a GTP engine using the model to make moves directly: `./play.py -model <MODELS_DIRECTORY>/model<EPOCH>`
+      * Dump one of the raw weight matrices: `./visualize.py -model-file <MODELS_DIRECTORY>/model<EPOCH> -dump p2/w:0`
+      * Directly run a model on a position from an sgf file: `./eval_sgf.py -model-file <MODELS_DIRECTORY>/model<EPOCH> -sgf SGFFILE -move TURNNUMBER`
+
+See LICENSE for software license. License aside, informally, if do you successfully use any of the code or any wacky ideas about neural net structure explored in this repo in your own neural nets or to run any of your own experiments, I would to love hear about it and/or might also appreciate a casual acknowledgement where appropriate. Yay.
+
+### Training a Model
+
+If you'd like to train your own model and/or experiment with the architectures in this repo, here are the steps.
+   * You must have HDF5 installed for C++ (https://support.hdfgroup.org/HDF5/release/obtainsrc.html).
    * Training consists of converting .sgf files into training rows written in HDF5 format, then reading that HDF5 file in Python to train using numpy and h5py to feed them to Tensorflow.
    * The utility that converts .sgf to .h5 is written in C++. Compile it using CMake and make in writedata, which expects you to have h5c++
      available. (on windows you can also use the deprecated compile.sh):
@@ -16,11 +28,6 @@ This repo is currently a sandbox for personal experimentation in neural net trai
       * By default, it considers 1 million samples to be one epoch. If you want to see initial results more quickly you can pass `-fast-factor N` to divide this and a few other things by N, and `-validation-prop P` to only use P proportion of the validation set, but you will probably want to manually modify train.py (see `TRAINING PARAMETERS` section) as well as model.py (to try a much smaller neural net first).
       * Depending on how much training data you have and how long you're willing to train for, you probably want to edit the `knots` in train.py that determine the learning rate schedule.
       * If you getting errors due to running out of GPU memory, you probably want to also edit train.py to shrink the `batch_size` and/or the `validation_batch_size`.
-   * Do things with the trained model
-      * Run a GTP engine using the model to make moves directly: `./play.py -model <MODELS_DIRECTORY>/model<EPOCH>`
-      * Dump one of the raw weight matrices: `./visualize.py -model-file <MODELS_DIRECTORY>/model<EPOCH> -dump p2/w:0`
-
-See LICENSE for software license. License aside, informally, if do you successfully use any of the code or any wacky ideas about neural net structure explored in this repo in your own neural nets or to run any of your own experiments, I would to love hear about it and/or might also appreciate a casual acknowledgement where appropriate. Yay.
 
 # Experimental Notes
 You can see the implementations of the relevant neural net structures in "model.py", although I may adapt and change them as time goes on.