Tag: reinforcement learning
All the articles with the tag "reinforcement learning".
-
强化学习中的数学原理(四):时序差分学习与价值函数近似
Mathematical Principles in Reinforcement Learning summary part 4 - Temporal-Difference Learning (TD, SARSA, Q-Learning) and Value Function Approximation
-
强化学习中的数学原理(三):蒙特卡洛方法与随机近似
Mathematical Principles in Reinforcement Learning summary part 3 - Monte Carlo methods and Stochastic Approximation (SGD)
-
强化学习中的数学原理(二):贝尔曼最优方程与迭代算法
Mathematical Principles in Reinforcement Learning summary part 2 - Bellman Optimality Equation and Value/Policy Iteration
-
cs285
cs285 summary