Skip to content

litehed/Agent-Implementations-RL

Repository files navigation

Agent Implementations RL

Here I am implementing the agents as I read through them in the Sutton and Bartow book: Reinforcement Learning An Introduction 2nd edition. Unless stated otherwise these are trained to find q* using ε-greedy policy then evaluated using a greedy policy on the q* obtained through training.

Agents Implemented:

Chapter 5 - Monte Carlo Methods

  • On-policy first-visit Monte Carlo
  • Off-policy Monte Carlo

Chapter 6 - Temporal-Difference Learning

  • SARSA(0)
  • Q-Learning
  • Expected SARSA
  • Double Q-Learning

Chapter 7 - n-step Bootstrapping

  • n-step SARSA
  • Off-Policy n-step SARSA
  • n-step Tree Backup

Chapter 8 - Planning and Learning with Tabular Methods

  • Tabular Dyna-Q

Chapter 10 - On-policy Control with Approximation

  • Episodic Semi-Gradient Sarsa

Chapter 12 - Eligibility Traces

  • SARSA(λ)

Testing:

I am testing each of these using different gymnasium environments.

About

I am using this as a way to store my implementations of agents brought up in Reinforcement Learning An Introduction 2nd edition

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages