diff --git a/Notes_CN/DRL.pdf b/Notes_CN/DRL.pdf index 58215ee..66f476f 100644 Binary files a/Notes_CN/DRL.pdf and b/Notes_CN/DRL.pdf differ diff --git a/Notes_CN/README.md b/Notes_CN/README.md new file mode 100644 index 0000000..b251ebe --- /dev/null +++ b/Notes_CN/README.md @@ -0,0 +1,19 @@ +# Deep Reinforcement Learning Book In Chinese + +中文图书[人民邮电出版社](https://www.ituring.com.cn/book/2982)编辑和出版,全书294页,彩色印刷。草稿仍然可以在我的GitHub上免费下载。正式出版的书经过了作者和编辑的反复修改和校对,并添加了少量新的内容、习题、以及全部习题的答案。本书算法的PyTorch代码在这里 [[链接]](https://github.com/DeepRLChinese/DeepRL-Chinese)。 + +京东:[https://u.jd.com/eLdsveg](https://u.jd.com/eLdsveg) + +当当:[http://product.dangdang.com/29490069.html‍‬⁢‬‍‌⁢⁣](http://product.dangdang.com/29490069.html) + + + +![book1](book1.png) + + +![book2](book2.jpg) + + +![book3](book3.jpg) + + diff --git a/Notes_CN/book1.png b/Notes_CN/book1.png new file mode 100644 index 0000000..9bc6658 Binary files /dev/null and b/Notes_CN/book1.png differ diff --git a/Notes_CN/book2.jpg b/Notes_CN/book2.jpg new file mode 100644 index 0000000..c15becb Binary files /dev/null and b/Notes_CN/book2.jpg differ diff --git a/Notes_CN/book3.jpg b/Notes_CN/book3.jpg new file mode 100644 index 0000000..b4aec6b Binary files /dev/null and b/Notes_CN/book3.jpg differ diff --git a/README.md b/README.md index 84cea7f..161b6af 100644 --- a/README.md +++ b/README.md @@ -90,23 +90,28 @@ 5. **Advanced Topics on Policy-Based Learning.** - * Trust-Region Policy Optimization (TRPO). + * Trust-Region Policy Optimization (TRPO) + [[slides](https://github.com/wangshusen/DRL/blob/master/Slides/5_Policy_1.pdf)] + [[Video (in Chinese)](https://youtu.be/fcSYiyvPjm4)]. - * Policy Network + RNNs. + * Partial Observation and RNNs. 6. **Dealing with Continuous Action Space.** - * Discrete versus Continuous Control. - [[slides](https://github.com/wangshusen/DRL/blob/master/Slides/6_Continuous_1.pdf)] + * Discrete versus Continuous Control + [[slides](https://github.com/wangshusen/DRL/blob/master/Slides/6_Continuous_1.pdf)] + [[Video (in Chinese)](https://youtu.be/rRIjgdxSvg8)]. - * Deterministic Policy Gradient (DPG) for Continuous Control. + * Deterministic Policy Gradient (DPG) for Continuous Control [[slides](https://github.com/wangshusen/DRL/blob/master/Slides/6_Continuous_2.pdf)] + [[Video (in Chinese)](https://youtu.be/cmWejKRWLA8)]. - * Stochastic Policy Gradient for Continuous Control. + * Stochastic Policy Gradient for Continuous Control [[slides](https://github.com/wangshusen/DRL/blob/master/Slides/6_Continuous_3.pdf)] + [[Video (in Chinese)](https://youtu.be/McqFyl_W5Wc)]. diff --git a/Slides/1_Basics_1.pdf b/Slides/1_Basics_1.pdf index 4297f14..f8eb258 100644 Binary files a/Slides/1_Basics_1.pdf and b/Slides/1_Basics_1.pdf differ diff --git a/Slides/1_Basics_2.pdf b/Slides/1_Basics_2.pdf index 74c1434..5a21cc8 100644 Binary files a/Slides/1_Basics_2.pdf and b/Slides/1_Basics_2.pdf differ diff --git a/Slides/1_Basics_3.pdf b/Slides/1_Basics_3.pdf index 463228c..010ab8f 100644 Binary files a/Slides/1_Basics_3.pdf and b/Slides/1_Basics_3.pdf differ diff --git a/Slides/1_Basics_4.pdf b/Slides/1_Basics_4.pdf index 4358964..54cc4b6 100644 Binary files a/Slides/1_Basics_4.pdf and b/Slides/1_Basics_4.pdf differ diff --git a/Slides/5_Policy_1.pdf b/Slides/5_Policy_1.pdf new file mode 100644 index 0000000..88cc294 Binary files /dev/null and b/Slides/5_Policy_1.pdf differ