🧠 Reinforcement Learning — Concepts and Implementations from Sutton & Barto.

This repository contains implementations of foundational algorithms and experiments from “Reinforcement Learning: An Introduction” by Richard S. Sutton and Andrew G. Barto, recreated and explored in Jupyter Notebooks for educational purposes.

The goal of this project is to understand reinforcement learning deeply through code, starting from basic value estimation to advanced policy-based methods.

📁 Project Structure

File	Description
`rl.ipynb`	Core reinforcement learning implementations, covering basic algorithms such as Monte Carlo methods, Temporal Difference (TD) learning, and tabular Q-learning.
`rl2.ipynb`	Extended experiments exploring policy gradients, actor-critic methods, and environment simulations using OpenAI Gym.

🧩 Topics Covered

Markov Decision Processes (MDPs)
Monte Carlo Prediction & Control
Temporal Difference Learning (TD(0), SARSA, Q-learning)
Exploration vs. Exploitation (ε-greedy, Softmax policies)
Policy Gradient Methods (REINFORCE)
Actor-Critic Architectures
Value Function Approximation

⚙️ How to Run it

Install required libraries

pip install numpy matplotlib gym torch jupyter

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
LICENSE		LICENSE
README.md		README.md
rl.ipynb		rl.ipynb
rl2.ipynb		rl2.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

🧠 Reinforcement Learning — Concepts and Implementations from Sutton & Barto.

📁 Project Structure

🧩 Topics Covered

⚙️ How to Run it

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

🧠 Reinforcement Learning — Concepts and Implementations from Sutton & Barto.

📁 Project Structure

🧩 Topics Covered

⚙️ How to Run it

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages