DQN-pong

Overview

This code is a forked version of Sirajology's pong_neural_net_live project. In a live session he built the game Pong from scratch. Then he built a Deep Q Network that gets better over time through trial and error. The DQN is a convolutional neural network that uses the pixel data and the game score as input parameters. Through reinforcement learning, it learns what moves it needs to make to become better.

Because the code from the original project did not work, I began to fix several bugs, by combining Sirajology's and asrivat1's code.

List of fixed bugs:

The ball flew through the left paddle - fixed by updating the collision detection
The agent never got positive rewards - fixed by giving the agent a reward for defending the ball
The session has been saved, but not loaded - fixed by inserting a loading routine
The agent didn't get better over time - fixed by updating the network topology and cost function
The agent copied the movement of the right paddle (overfitting) - fixed by adding random movement to that paddle
The agent got multiple rewards when the ball flew through the left paddle from behind - fixed by adding a routine to get only one reward

Improvements:

More detailed and cleaner console output
Console output can be written in a log file
Cleaned up and shortened pong.py
Added two finishing conditions and a cleanup when exiting

ToDo:

Automatically detect RAM usage, so the code don't use more then available.
Rework messy bugfixes

Now everything works as it should.

Installation

Dependencies:

Use pip to install the dependencies. For tensorflow and cv2 follow the instructions on the project websites.

make sure to have the following file structure:

DQN-pong
|-- logs/
|-- saved_networks/
|-- RL.py
|-- pong.py

if "logs/" or "saved_networks/" are missing, create them by yourself.

Usage

Run it like this in terminal. It will take about 1,000,000 Timesteps until the ai plays almost perfect.

python RL.py

Name		Name	Last commit message	Last commit date
Latest commit History 26 Commits
.gitignore		.gitignore
README.md		README.md
RL.py		RL.py
pong.py		pong.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

DQN-pong

Overview

Installation

Usage

About

Releases

Packages

Languages

TalviT/DQN-Pong

Folders and files

Latest commit

History

Repository files navigation

DQN-pong

Overview

Installation

Usage

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages