add SIL, fix PG loss bug, add dueling networks
This release adds some new implementations, and fixes some bugs from first benchmark runs.
Implementations
#127 Self-Imitation Learning
#128 Checkpointing for saving models
#129 Dueling Networks
Bug Fixes
#132 GPU test-run fixes
#133 fix ActorCritic family loss compute getting detached, and linux plotting issues, add SHA to generated specs