Skip to content

Latest commit

 

History

History
16 lines (14 loc) · 483 Bytes

README.md

File metadata and controls

16 lines (14 loc) · 483 Bytes

algorithm-learning

Reimplementation of Learning Simple Algorithms From Examples paper on Julia

TODO

  • Revisit grid tasks data generation
  • Make supervised grid tasks working
  • GRU controller
  • Model save
  • Visualization/demo (Have some harmless bugs. Still wip)
  • Q-learning (work in progress)

Q-Learning

  • implement make_batches function
  • implement loss/train function
  • debug run_episodes! and other things
  • correct the objective and training