Skip to content

Commit c7269da

Browse files
committed
added sections
1 parent c20cf89 commit c7269da

File tree

1 file changed

+17
-5
lines changed

1 file changed

+17
-5
lines changed

README.md

Lines changed: 17 additions & 5 deletions
Original file line numberDiff line numberDiff line change
@@ -6,20 +6,20 @@
66
1. **Fundamentals.**
77

88

9-
* Reinforcement learning
9+
* Reinforcement Learning
1010
[[slides](https://github.com/wangshusen/DRL/blob/master/Slides/1_Basics_1.pdf)]
1111
[[lecture note](https://github.com/wangshusen/DeepLearning/blob/master/LectureNotes/DRL/DRL.pdf)]
1212
[[Video (in Chinese)](https://youtu.be/vmkRMvhCW5c)].
1313

14-
* Value-based learning
14+
* Value-Based Learning
1515
[[slides](https://github.com/wangshusen/DRL/blob/master/Slides/1_Basics_2.pdf)]
1616
[[Video (in Chinese)](https://youtu.be/jflq6vNcZyA)].
1717

18-
* Policy-based learning
18+
* Policy-Based Learning
1919
[[slides](https://github.com/wangshusen/DRL/blob/master/Slides/1_Basics_3.pdf)]
2020
[[Video (in Chinese)](https://youtu.be/qI0vyfR2_Rc)].
2121

22-
* Actor-critic methods
22+
* Actor-Critic Methods
2323
[[slides](https://github.com/wangshusen/DRL/blob/master/Slides/1_Basics_4.pdf)]
2424
[[Video (in Chinese)](https://youtu.be/xjd7Jq9wPQY)].
2525

@@ -37,6 +37,8 @@
3737
* Double DQN.
3838

3939
* Dueling DQN.
40+
41+
* Multi-Step Return.
4042

4143

4244

@@ -48,15 +50,25 @@
4850
* Advantage Actor-Critic (A2C).
4951

5052
* Trust-Region Policy Optimization (TRPO).
53+
54+
* Policy Network + RNNs.
5155

5256

5357
4. **Multi-Agent Reinforcement Learning.**
5458

55-
* Basics and challenges
59+
* Basics and Challenges
5660
[[slides](https://github.com/wangshusen/DRL/blob/master/Slides/4_MARL_1.pdf)]
5761
[[Video (in Chinese)](https://youtu.be/KN-XMQFTD0o)].
5862

5963
* Centralized VS Decentralized
6064
[[slides](https://github.com/wangshusen/DRL/blob/master/Slides/4_MARL_2.pdf)]
6165
[[Video (in Chinese)](https://youtu.be/0HV1hsjd1y8)].
6266

67+
68+
69+
5. **Imitation Learning.**
70+
71+
72+
* Inverse Reinforcement Learning.
73+
74+
* Generative Adversarial Imitation Learning (GAIL).

0 commit comments

Comments
 (0)