Skip to content

Latest commit

 

History

History
16 lines (9 loc) · 561 Bytes

File metadata and controls

16 lines (9 loc) · 561 Bytes

This is a modular architecture for model based reinforcement learning using search.

The components are separated, facilitating the creation of the agents and the extension of the existing components

Architecture

The current implementation learns the hidden-states for planning and an action-mask.

MORE ABOUT THIS: PAPER

to try it, run test.py: it will ask you to choose from different components and then run.