Version v0.25.0 release #2949

pseudo-rnd-thoughts · 2022-07-05T14:35:56Z

Release notes

API Changes

Step - A majority of deep reinforcement learning algorithm implementations are incorrect due to an important difference in theory and practice as done is not equivalent to termination. As a result, we have modified the step function to return five values, obs, reward, termination, truncation, info. The full theoretical and practical reason (along with example code changes) for these changes will be explained in a soon-to-be-released blog post. The aim for the change to be backward compatible (for now), for issues, please put report the issue on github or the discord. @arjun-kg
Render - The render API is changed such that the mode has to be specified during gym.make with the keyword render_mode, after which, the render mode is fixed. For further details see https://younis.dev/blog/2022/render-api/ and Render API #2671. This has the additional changes
- with render_mode="human" you don't need to call .render(), rendering will happen automatically on env.step()
- with render_mode="rgb_array", .render() pops the list of frames rendered since the last .reset()
- with render_mode="single_rgb_array", .render() returns a single frame, like before.
Space.sample(mask=...) allows a mask when sampling actions to enable/disable certain actions from being randomly sampled. We recommend developers add this to the info parameter returned by reset(return_info=True) and step. See Added Action masking for Space.sample() #2906 for example implementations of the masks or the individual spaces. We have added an example version of this in the taxi environment. @pseudo-rnd-thoughts
Add Graph for environments that use graph style observation or action spaces. Currently, the node and edge spaces can only be Box or Discrete spaces. @jjshoots
Add Text space for Reinforcement Learning that involves communication between agents and have dynamic length messages (otherwise MultiDiscrete can be used). @ryanrudes @pseudo-rnd-thoughts

Bug fixes

Fixed car racing termination where if the agent finishes the final lap, then the environment ends through truncation not termination. This added a version bump to Car racing to v2 and removed Car racing discrete in favour of gym.make("CarRacing-v2", continuous=False) @araffin
In v0.24.0, opencv-python was an accidental requirement for the project. This has been reverted. @KexianShen @pseudo-rnd-thoughts
Updated utils.play such that if the environment specifies keys_to_action, the function will automatically use that data. @Markus28
When rendering the blackjack environment, fixed bug where rendering would change the dealers top car. @balisujohn
Updated mujoco docstring to reflect changes that were accidently overwritten. @Markus28

Misc

The whole project is partially type hinted using pyright (none of the project files is ignored by the type hinter). @RedTachyon @pseudo-rnd-thoughts (Future work will add strict type hinting to the core API)
Action masking added to the taxi environment (no version bump due to being backwards compatible) @pseudo-rnd-thoughts
The Box space shape inference is allows high and low scalars to be automatically set to (1,) shape. Minor changes to identifying scalars. @pseudo-rnd-thoughts
Added option support in classic control environment to modify the bounds on the initial random state of the environment @psc-g
The RecordVideo wrapper is becoming deprecated with no support for TextEncoder with the new render API. The plan is to replace RecordVideo with a single function that will receive a list of frames from an environment and automatically render them as a video using MoviePy. @johnMinelli
The gym py.Dockerfile is optimised from 2Gb to 1.5Gb through a number of optimisations @TheDen

… to fix bug if environment doesn't use np_random in reset

…n the opposite case than was intended to (openai#2871)" This reverts commit 519dfd9.

pseudo-rnd-thoughts added 28 commits June 8, 2022 17:19

Allows a new RNG to be generated with seed=-1 and updated env_checker…

c390a6b

… to fix bug if environment doesn't use np_random in reset

Revert "fixed gym.vector.make where the checker was being applied i…

654476e

…n the opposite case than was intended to (openai#2871)" This reverts commit 519dfd9.

Merge branch 'openai:master' into master

ea110ad

Remove bad pushed commits

2e5dc9c

Merge branch 'openai:master' into master

717bc1f

Merge branch 'openai:master' into master

7e73d04

Merge branch 'openai:master' into master

5743cd9

Fixed spelling in core.py

400b1e9

Pins pytest to the last py 3.6 version

4281c76

Allow Box automatic scalar shape

4325630

Merge branch 'master' into master

dd517e9

Add test box and change default from () to (1,)

bdc7f20

update Box shape inference with more strict checking

7da3178

Update the box shape and add check on the custom Box shape

4154bbf

Removed incorrect shape type and assert shape code

fccc140

Merge branch 'openai:master' into master

c00f1d7

Merge branch 'openai:master' into master

5c35b7a

Update the Box and associated tests

ee9bb5f

Merge branch 'openai:master' into master

361cbe3

Merge branch 'openai:master' into master

4a695aa

Merge branch 'openai:master' into master

edc7207

Merge branch 'openai:master' into master

f16f4d2

Move dependency error to inside the atari environment

54724ab

Update the gym version to v0.25.0

75e242c

Merge branch 'openai:master' into master

3a90302

Merge branch 'openai:master' into master

9304104

Merge branch 'openai:master' into master

80ae99e

Merge branch 'openai:master' into master

ae128ca

jkterry1 merged commit aeda7eb into openai:master Jul 13, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Version v0.25.0 release #2949

Version v0.25.0 release #2949

pseudo-rnd-thoughts commented Jul 5, 2022 •

edited

Loading

Version v0.25.0 release #2949

Version v0.25.0 release #2949

Conversation

pseudo-rnd-thoughts commented Jul 5, 2022 • edited Loading

Release notes

API Changes

Bug fixes

Misc

pseudo-rnd-thoughts commented Jul 5, 2022 •

edited

Loading