LSTD-mu

LSTD-mu?

Batch, Off-policy and Model-free Apprenticeship Learning

(Projection method[Abbeel, Ng. 2004] + LSPI + LSTD-mu)

Dependency

Language

python3.6

Libraries

Tensorflow 1.5.0
gym (openai gym)
Numpy

Run

python3 bomap_main.py (default)

python3 bomap\_{}\_main.py (under construction)

Detail

(under construction)

Deep Action Network(DAN) for deep basis function features instead of simple basis function

IRL_DAN + Deep Reward Network(DRN) for irl instead of Projection method

Name		Name	Last commit message	Last commit date
Latest commit History 55 Commits
legacy_folder		legacy_folder
utils		utils
.gitignore		.gitignore
LSPI&BOMAP_final_meeting.pdf		LSPI&BOMAP_final_meeting.pdf
LSPI&BOMAP_first_meeting.pdf		LSPI&BOMAP_first_meeting.pdf
LSPI&BOMAP_initiative.pdf		LSPI&BOMAP_initiative.pdf
README.md		README.md
bomap_main.py		bomap_main.py
bomap_mc_main.py		bomap_mc_main.py
bomap_mc_with_dan_main.py		bomap_mc_with_dan_main.py
bomap_with_dan_main.py		bomap_with_dan_main.py
dandqn_main.py		dandqn_main.py
dandrndqn_main.py		dandrndqn_main.py
deep_action_network.py		deep_action_network.py
deep_cartpole.py		deep_cartpole.py
deep_q_network.py		deep_q_network.py
deep_q_network_without_drn.py		deep_q_network_without_drn.py
deep_reward_network.py		deep_reward_network.py
irl_test.py		irl_test.py
lspi.py		lspi.py
lstd_mu.py		lstd_mu.py
policy.py		policy.py
rbf.py		rbf.py
record.py		record.py
replay_memory.py		replay_memory.py
reward_basis.py		reward_basis.py
setup.cfg		setup.cfg
test.py		test.py
tf_utils.py		tf_utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

LSTD-mu

LSTD-mu?

Dependency

Language

Libraries

Run

Detail

About

Releases

Packages

Contributors 2

Languages

jeonggwanlee/LSTD-mu

Folders and files

Latest commit

History

Repository files navigation

LSTD-mu

LSTD-mu?

Dependency

Language

Libraries

Run

Detail

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages