-
Notifications
You must be signed in to change notification settings - Fork 260
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Compatible to OpenAI Gym? #4
Comments
We did implement OpenAI Gym compatibility, so you could train the agent on your own machine. But never released it, because the challenge was that we want that code to then be submitted and evaluated in an automated way, so that it could be considered for the leaderboard. There's ways to do this and it's something I'm hoping to do in 2019. Help would be appreciated, especially in ideas of how the full pipeline can be set up. Alternatively, we're considered a totally new Deep RL competition that from the beginning is designed to allow for both in-browser and offline training. I see us doing the latter, and continuing to use DeepTraffic as an accessible education tool. |
I see! Thanks for the clarification! |
@AI-Guru JFYI, I've implemented more or less accurate Gym version of deeptraffic environment. |
Thank you so very much! |
@jackft I have couple of questions about implementation, it would be nice to have answers without js-code reverse engineering :) First one: how input to the network is formed? I see input shape of
But how observations are flattened into the vector and in which order temporal history is appended into final observation is not clear. Ideally, it would be description like this:
Second question: how occupancy is calculated? I guess every cell has height of 10 pixels, but when cell is considered occupied? If single pixel is occupied or if more than half of the cell is overlapped by car? Third question: car dynamics. How speed is being calculated? Does car changes its effective speed immediately as car in front of it changes lane or it happened smoothly? How acceleration/braking effect speed? I guess, it could be lots of questions, but I'm trying to build an accurate python version of the environment, which, I believe, could be useful in future launches of competition. On my side, I can make a promise to opensource it for everyone's benefits, now or later :). |
@Shmuma Second, occupancy is calculated by looping through all the cars and adding them to the occupancy grid. We map the car coordinates to cell coordinates and then take the floor of the cell coordinates: the result is a pair of indices. Third, there are several parts to this question. |
@shumna, any news? :D |
@AI-Guru Yep, sorry for delay :) There is my repo with gym-compatible DeepTraffic environment: https://github.com/Shmuma/deep-traffic-2019 It includes environment with tests, training and playing code and not fully finished pytorch -> js converter of the trained model. This project is not finished (sadly), I was distracted by other things. The next step should be finishing JS export utility and make sure gym environment has close dynamics to the native code. |
Namaste!
Great work! I really like it!
Question: Is DeepTraffic compatible to OpenAI Gym? I remember Nvidia writing about it in a 2017 blog-article.
https://blogs.nvidia.com/blog/2017/07/07/deeptraffic-how-an-mit-simulation-game-uses-deep-learning-to-reduce-gridlock/
Best,
Tristan
The text was updated successfully, but these errors were encountered: