Skip to content

act

Pre-release
Pre-release
Compare
Choose a tag to compare
@cpnota cpnota released this 07 Jun 22:20
· 215 commits to master since this release
9368f98

The release contains a bunch of changes under the hood. The agent API was simplified down to a single method, action = agent.act(state, reward). The accompany this change, State was added as a first class object. Terminal states now have the state.mask set to 0, whereas before terminal states were represented by None.

Another major addition is slurm support. This is in particular to aid in running on gypsum. The SlurmExperiment API handles the creation of the appropriate .sh files, output, etc., so experiments can be run on slurm by writing a single python script! No more writing .sh files by hand! Examples can be found in the demos folder.

There were a few other minor changes as well.

Change log:

  • Simplified agent API to only include act #56
  • Added State object #51
  • Added SlurmExperiment for running on gypsum #53
  • Updated the local and release scripts, and added slurm demos #54
  • Tweaked parameter order in replay buffers #59
  • Improved shared feature handling #63
  • Made write_loss togglable #64
  • Tweaked default hyperparameters