This is a repository for a class project in reinforcement learning.
Contributors:
Zoe Kanavas ([email protected])
Erin Musabandesu ([email protected])
Liam Lynch ([email protected])
UC Davis Google Drive Data Access
- Sample_A (data folder)
- heuristic_info_all_samples.csv
Creates a folder for results (pickled dictionary) and figs based on a trial number. The trial number is also taken as the random seed.
- Episodic semigradient SARSA (Sutton and Barto, pg. 244) - with linear approximation function
- Continuous semigradient SARSA (Sutton and Barto, pg. 251) - with linear approximation function