Dependencies:
- Python 3.5
- Numpy version 1.14.5
- TensorFlow version 1.10.5
- MuJoCo version 1.50 and mujoco-py 1.50.1.56
- OpenAI Gym version 0.10.5
- seaborn
- Box2D==2.3.2
Before doing anything, first replace gym/envs/box2d/lunar_lander.py
with the provided lunar_lander.py
file.
The only file that you need to look at is train_pg_f18.py
, which you will implement.
See the HW2 PDF for further instructions.
Error AttributeError: module '_Box2D' has no attribute 'RAND_LIMIT_swigconstant'
, run this:
pip3 install box2d box2d-kengz