Google DeepMind的David Silver的强化学习课程讲义,包括Markov Decision Processes、Planning by Dynamic Programming、Model-Free Prediction、Model-Free Control、Function Approximation、Policy Gradient Methods、Integrating Learning and Planning、Exploration and Exploitation以及游戏案例分析。视频:https://www.youtube.com/playlist?list=PL5X3mDkKaJrL42i_jhE4N-p6E2Ol62Ofa
2019-12-21 19:50:53
20.35MB
强化学习
1