Skip to content

Latest commit

 

History

History
21 lines (13 loc) · 522 Bytes

README.md

File metadata and controls

21 lines (13 loc) · 522 Bytes

Quantile Regression DQN

Each experiment uses 3 seeds and is trained for 10k environment steps. The parameters used for QR-DQN are the same parameters as described in the original paper.

Breakout QR-DQN - single worker

coach -p Atari_QR_DQN -lvl breakout

Breakout QR-DQN

Pong QR-DQN - single worker

coach -p Atari_QR_DQN -lvl pong

Pong QR-DQN