Behaviors of Atari envs have changed by atari-py>=0.2 #1777

muupan · 2020-01-05T09:50:57Z

The behaviors of Atari envs seem affected by the version of atari-py even though the env id the same.

import gym

env = gym.make('MsPacmanNoFrameskip-v4')
env.seed(0)
env.reset()
done = False
t = 0
R = 0
while not done:
    _, r, done, _ = env.step(1)
    t += 1
    R += r
print(t, R)

Below is the output of this code for each pair of gym and atari-py. It seem like atari-py is the cause of the difference, but since gym requires atari-py~=0.2.0 in setup.py from 1.3.0 (#1535) it should be responsible for the version of atari-py. That is why I opened this issue here, not in https://github.com/openai/atari-py.

gym==0.15.4 atari-py==0.2.6: 2009 90.0
gym==0.15.4 atari-py==0.2.0: 2009 90.0
gym==0.15.4 atari-py==0.1.15: 1329 90.0
gym==0.15.4 atari-py==0.1.4: 1329 90.0
gym==0.12.6 atari-py==0.2.6: 2009 90.0
gym==0.12.6 atari-py==0.2.0: 2009 90.0
gym==0.12.6 atari-py==0.1.15: 1329 90.0
gym==0.12.6 atari-py==0.1.4: 1329 90.0

I also confirmed such a difference for ChopperCommandNoFrameskip-v4.

I am concerned that these differences might significantly affect the evaluation of RL algorithms. Has anyone investigated the effect?

The text was updated successfully, but these errors were encountered:

muupan · 2020-01-06T03:01:35Z

Below are the results obtained by my PPO implementation on Atari using 3 different seeds for each configuration. gym==0.12.1 was used. It seems that it actually affect the performance for some games. I'm not sure what change in atari-py or ALE caused this though.

kngwyu · 2020-01-06T11:19:42Z

openai/atari-py#49 looks the biggest change

christopherhesse · 2020-01-10T23:27:44Z

The changes were meant specifically to not change the behavior of the environments: openai/atari-py#49 (comment)

@JesseFarebro any idea what might be going on here?

christopherhesse · 2020-01-10T23:28:15Z

Thanks for investigating this @muupan

JesseFarebro · 2020-01-11T00:27:09Z

Hi @muupan @christopherhesse,

Thanks for bringing this to my attention. I'll investigate further and let you know what my findings are.

JesseFarebro · 2020-01-11T03:07:03Z

Thanks for the reproduction @muupan.

With regards to Ms. Pacman: Essentially what is happening is we press the reset button too many times and this leads to a different starting state in ALE v0.6.0. The following commit introduced this bug: Farama-Foundation/Arcade-Learning-Environment@7bff96b#diff-d9d868097a7403416e6ef352d95dc4feR85. This should have minimal effects on performance comparisons between v0.5.2 and v0.6.*.

Here are two images comparing the first frame in each ALE version:

Ms. Pacman v0.5.2

Ms. Pacman v0.6.0

With regards to Chopper Command: The issue that affects Ms. Pacman also affects Chopper Command. The call to softReset happens in two places. The one linked above and also Farama-Foundation/Arcade-Learning-Environment@31d8e17#diff-0ff5bae3de90143156577bc8324e6d27R155.

As for the performance difference in your PPO agent that doesn't actually strike me as overly surprising. If we assume that these two runs weren't completely deterministic (due to a different episode start state as discussed above, or an improper seed) these curves seem within reason for 3 seeds. I looked at the original PPO paper and their results on Chopper Command show large variance between their 3 runs.

I have opened an issue upstream (Farama-Foundation/Arcade-Learning-Environment#291).

Hopefully, this helps clear some things up.

christopherhesse · 2020-01-17T23:00:26Z

Thanks for investigating @JesseFarebro!

JesseFarebro · 2021-08-05T04:55:42Z

This is being tracked upstream Farama-Foundation/Arcade-Learning-Environment#291. Feel free to close this.

JesseFarebro mentioned this issue Jan 11, 2020

ALE v0.6 differences in start state Farama-Foundation/Arcade-Learning-Environment#291

Open

jkterry1 added the help wanted label Jul 30, 2021

jkterry1 closed this as completed Aug 5, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Behaviors of Atari envs have changed by atari-py>=0.2 #1777

Behaviors of Atari envs have changed by atari-py>=0.2 #1777

muupan commented Jan 5, 2020 •

edited

Loading

muupan commented Jan 6, 2020

kngwyu commented Jan 6, 2020

christopherhesse commented Jan 10, 2020

christopherhesse commented Jan 10, 2020

JesseFarebro commented Jan 11, 2020

JesseFarebro commented Jan 11, 2020 •

edited

Loading

christopherhesse commented Jan 17, 2020

JesseFarebro commented Aug 5, 2021

Behaviors of Atari envs have changed by atari-py>=0.2 #1777

Behaviors of Atari envs have changed by atari-py>=0.2 #1777

Comments

muupan commented Jan 5, 2020 • edited Loading

muupan commented Jan 6, 2020

kngwyu commented Jan 6, 2020

christopherhesse commented Jan 10, 2020

christopherhesse commented Jan 10, 2020

JesseFarebro commented Jan 11, 2020

JesseFarebro commented Jan 11, 2020 • edited Loading

Ms. Pacman v0.5.2

Ms. Pacman v0.6.0

christopherhesse commented Jan 17, 2020

JesseFarebro commented Aug 5, 2021

muupan commented Jan 5, 2020 •

edited

Loading

JesseFarebro commented Jan 11, 2020 •

edited

Loading