Skip to content

Batch, Off-policy and Model-free Apprenticeship Learning(LSTD-mu, or BOMAP) with TF1.5

Notifications You must be signed in to change notification settings

jeonggwanlee/LSTD-mu

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

55 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

LSTD-mu

LSTD-mu?

Batch, Off-policy and Model-free Apprenticeship Learning

(Projection method[Abbeel, Ng. 2004] + LSPI + LSTD-mu)

Dependency

Language

  • python3.6

Libraries

  • Tensorflow 1.5.0
  • gym (openai gym)
  • Numpy

Run

python3 bomap_main.py (default)

python3 bomap\_{}\_main.py (under construction)

Detail

(under construction)

Deep Action Network(DAN) for deep basis function features instead of simple basis function

IRL_DAN + Deep Reward Network(DRN) for irl instead of Projection method

About

Batch, Off-policy and Model-free Apprenticeship Learning(LSTD-mu, or BOMAP) with TF1.5

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages