Offline Reinforcement Learning at Multiple Frequencies

Code for reproducing the results of Offline RL at Multiple Frequencies (arXiv, website).

Offline data was collected from replay buffers during training with the DAU repository or this repository and can be downloaded here.

This repository builds off of Young Geng's implementation of CQL.

Installation

Install and use the included Ananconda environment

$ conda env create -f environment.yml
$ source activate

Add this repo directory to your PYTHONPATH environment variable.

export PYTHONPATH="$PYTHONPATH:$(pwd)"

Run Experiments

We provide example run scripts for pendulum, door, and kitchen.

For example, to run the adaptive n-step algorithm:

./run_kitchen.sh 120 101 .99 500

To run the naive mixing baseline:

./run_kitchen.sh 0 101 .99 500

The max n-step baseline can be run by setting the all_same_N flag to True and the individual training baselines can be run by commenting out the data loaders.

Experiment Tracking with Weights and Biases

By default, the scripts log to W&B. To log to W&B, set your W&B API key environment variable:

export WANDB_API_KEY='YOUR W&B API KEY HERE'

Name		Name	Last commit message	Last commit date
Latest commit History 109 Commits
SimpleSAC		SimpleSAC
viskit		viskit
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
data_scripts.py		data_scripts.py
debug.sh		debug.sh
environment.yml		environment.yml
eval.sh		eval.sh
hyperparam.sh		hyperparam.sh
hyperparam_collect.sh		hyperparam_collect.sh
run.sh		run.sh
run_collect.sh		run_collect.sh
run_door.sh		run_door.sh
run_kitchen.sh		run_kitchen.sh
run_pendulum.sh		run_pendulum.sh
vid_from_npz.py		vid_from_npz.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Offline Reinforcement Learning at Multiple Frequencies

Installation

Run Experiments

Experiment Tracking with Weights and Biases

About

Releases

Packages

Contributors 2

Languages

License

stanford-iris-lab/offline_rl_at_multiple_freqs

Folders and files

Latest commit

History

Repository files navigation

Offline Reinforcement Learning at Multiple Frequencies

Installation

Run Experiments

Experiment Tracking with Weights and Biases

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages