Issues with NewsVendor env #31

jennafu · 2022-11-24T04:45:55Z

I have been attempting to run the NewsVendor environment, with these specific RL and Env configurations (copied from the example notebooks), and it seems like the reward is stuck at around -20,000, after around 500 iterations.

I was wondering what particular configurations I may need to adjust, to see improvements in the reward?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Issues with NewsVendor env #31

Issues with NewsVendor env #31

jennafu commented Nov 24, 2022

Issues with NewsVendor env #31

Issues with NewsVendor env #31

Comments

jennafu commented Nov 24, 2022