Posts
All the articles I've posted.
-
MIT 6.S978 Deep Generative Models(一):从 AutoEncoder 到 Variational AutoEncoder
从第一性原理理解 VAE:潜变量、ELBO、重参数化与代码实现。
-
强化学习中的数学原理(五):策略梯度与 Actor-Critic
Mathematical Principles in Reinforcement Learning summary part 5 - Policy Gradient Methods and Actor-Critic Architecture
-
强化学习中的数学原理(四):时序差分学习与价值函数近似
Mathematical Principles in Reinforcement Learning summary part 4 - Temporal-Difference Learning (TD, SARSA, Q-Learning) and Value Function Approximation
-
强化学习中的数学原理(三):蒙特卡洛方法与随机近似
Mathematical Principles in Reinforcement Learning summary part 3 - Monte Carlo methods and Stochastic Approximation (SGD)