Stockholm AI RL Group Implementations

In Stockholm AI reinforcement learning(RL) group, we discuss RL algorithms and their applications. In order to understand the algorithms fully, we focus on the theoratical and practical details of each algorithm including classic ones and deep ones. This repo is for our internal communication purpose.

We implemented our algorithms using gym and keras and after which we tested the algorithms on the simple cartpole example. The repo is implemented for people to understand the algorithm rather then receiving a good performance on this specific problem. We try to write the code in a clear way.

Usage example

python cartpole.py

Release History

0.2.2
- Implemented the Proximal Policy Optimization method
0.2.1
- Implemented Monte Carlo method
0.2.0
- Fixed SARSA initialization problem
0.1.1
- Implemented SARSA
0.1.0
- Implemented the tabular Q learning
0.0.1
- Work in progress

LICENCE

MIT Licence

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
gym-bandits @ 9376004		gym-bandits @ 9376004
.gitmodules		.gitmodules
LICENSE		LICENSE
README.md		README.md
bandits.py		bandits.py
cartpole.py		cartpole.py
monte_carlo_agent.py		monte_carlo_agent.py
ppo_agent.py		ppo_agent.py
random_agent.py		random_agent.py
sarsa_agent.py		sarsa_agent.py
tabular_q_agent.py		tabular_q_agent.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Stockholm AI RL Group Implementations

Usage example

Release History

LICENCE

Meta

About

Releases

Packages

Languages

License

usr-lab/stockholm-ai-rl

Folders and files

Latest commit

History

Repository files navigation

Stockholm AI RL Group Implementations

Usage example

Release History

LICENCE

Meta

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages