Skip to content
Open
Show file tree
Hide file tree
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
Binary file added .DS_Store
Binary file not shown.
File renamed without changes.
Binary file added 2. Multi-Armed Bandits/.DS_Store
Binary file not shown.
Binary file added 2. Multi-Armed Bandits/2. Multi-Armed Bandits.pdf
Binary file not shown.
Binary file not shown.
Binary file not shown.
Binary file not shown.
2 changes: 2 additions & 0 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -4,4 +4,6 @@ slides and other materials
| Title | Detail | Author | link |
| ------------------------------------------------------------ | ------------------------------------------------ | ------ | ------------------------------------------------------------ |
| Introduction about RL | 强化学习简介,包含基本的要素、强化学习分类和一些例子 | 李娜 | [slide](https://github.com/ECNUdase/Reinforcement-Learning-2020/blob/master/RL_introduction.pdf) |
| 第二章:多臂赌博机 | 多臂赌博机问题的各种经典解法,主要解决强化学习中的EE问题。 | 韩程程 | [slide](https://github.com/ECNUdase/Reinforcement-Learning-2020/blob/master/2. Multi-Armed Bandits/2. Multi-Armed Bandits.pdf) |
| 第五章:蒙特卡洛方法 | 利用蒙特卡洛方法进行策略评估和策略改进,包含on-policy和off-policy下的不同做法 | 刘婷婷 | [slide](https://github.com/ECNUdase/Reinforcement-Learning-2020/blob/master/5.%20MC%20Learning.pdf) |