-
Notifications
You must be signed in to change notification settings - Fork 261
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Tutorial: Integration with TF-Agents RL Framework #133
Labels
Comments
Tutorial code is a combination of TF-agent
|
@JacobHanouna, Would you like to prepare pull request with tutorial notebook? |
I can make a notebook tutorial, but because there is some code that need be change for it to work. I thought it would be an issue, this is why I posted it here. |
Ok, I see. |
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
BTgym have two main sections, the Gym framework and the RL algorithm framework.
The RL part is tailored to the unique gym requirements of BTgym, but as new research in the field is emerging there will be a benefit in exploring new algorithms that aren't implemented by this project.
The following tutorial is my own attempt of testing the integration between the Gym part of BTgym with an external RL framework. This tutorial is purely a Proof-of-Concept for testing this integration.
I took the most basic tutorial from the TF-Agent project - dqn tutorial
and tried to run it with BTgym.
A few notes:
DQN network has a simple implementation, it expect a simple array for action space and a simple array for observation space.
To resolve the action space issue, I have submitted a PR to TF-Agent that got rejected for basically being an overkill to the specification of the network. So you will need to manually apply those changes from here
To resolve the observation space you can manually collapse the dictionary or just work with 'external' as this is purely a proof of concept tutorial.
I had changed this line
py_environment.py
to get only the external tag from the dictionaryThe text was updated successfully, but these errors were encountered: