专栏名称: 机器学习研究会
机器学习研究会是北京大学大数据与机器学习创新中心旗下的学生组织,旨在构建一个机器学习从事者交流的平台。除了及时分享领域资讯外,协会还会举办各种业界巨头/学术神牛讲座、学术大牛沙龙分享会、real data 创新竞赛等活动。
目录
相关文章推荐
新智元  ·  DeepSeek-R1-Zero不存在顿悟时 ... ·  3 天前  
人工智能那点事  ·  刷医保可买华为手表?有药店超500人预约!各方回应 ·  3 天前  
爱可可-爱生活  ·  【[11.1k星]Subfinder:快速被 ... ·  3 天前  
51好读  ›  专栏  ›  机器学习研究会

【学习】分析momentum在梯度下降中的作用的好文章

机器学习研究会  · 公众号  · AI  · 2017-04-06 19:11

正文



点击上方 “机器学习研究会” 可以订阅哦
摘要

转自:孙明明_SmarterChina

Here’s a popular s tory about momentum [1, 2, 3] : gradient descent is a man walking down a hill. He follows the steepest path downwards; his progress is slow, but steady. Momentum is a heavy ball rolling down the same hill. The added inertia acts both as a smoother and an accelerator, dampening oscillations and causing us to barrel through narrow valleys, small humps and local minima.


This standard story isn’t wrong, but it fails to explain many important behaviors of momentum. In fact, momentum can be understood far more precisely if we study it on the right model.


One nice model is the convex quadratic. This model is rich enough to reproduce momentum’s local dynamics in real problems, and yet simple enough to be understood in closed form. This balance gives us powerful traction for understanding this algorithm.


We begin with gradient descent. The algorithm has many virtues, but speed is not one of them. It is simple -- when optimizing a smooth function f , we make a small step in t he gradient

For a step-size small enough, gradient descent makes a monotonic improvement at every iteration. It always converges, albeit to a local minimum. And under a few weak curvature conditions it can even get there at an exponential rate.


But the exponential decrease, though appealing in theory, can often be infuriatingly small. Things often begin quite well -- with an impressive, almost immediate decrease in the loss. But as the iterations progress, things start to slow down. You start to get a nagging feeling you're not making as much progress as you should be. What has gone wrong?


链接:







请到「今天看啥」查看全文