专栏名称: 机器学习研究会

机器学习研究会是北京大学大数据与机器学习创新中心旗下的学生组织，旨在构建一个机器学习从事者交流的平台。除了及时分享领域资讯外，协会还会举办各种业界巨头/学术神牛讲座、学术大牛沙龙分享会、real data 创新竞赛等活动。

Capsule Networks解析

机器学习研究会 · 公众号 · AI · 2017-11-12 20:15

正文

点击上方“机器学习研究会”可以订阅

摘要

转自：爱可可-爱生活

What are Capsule Networks and why do they exist?

The Capsule Network is a new type of neural network architecture conceptualized byGeoffrey Hinton, the motivation behind Capsule Networks is to address some of the short comings of Convolutional Neural Networks (ConvNets), which are listed below:

Problem 1: ConvNets are Translation Invariant [1]

What does that even mean? Imagine that we had a model that predicts cats. You show it an image of a cat, it predicts that it’s a cat. You show it the same image, but shifted to the left, it still thinks that it’s a cat without predicting any additional information.

Figure 1.0: Translation Invariance

What we want to strive for is translation equivariance. That means that when you show it an image of a cat shifted to the right, it predicts that it’s a cat shifted to the right. You show it the same cat but shifted towards the left, it predicts that its a cat shifted towards the left.

Figure 1.1: Translation Equivariance

Why is this a problem? ConvNets are unable to identify the position of one object relative to another, they can only identify if the object exists in a certain region, or not. This results in difficulty correctly identifying objects that have sub-objects that hold positional relationships relative to one another.

For example, a bunch of randomly assembled face parts will look like a face to a ConvNet, because all the key features are there:

链接：

https://kndrck.co/posts/capsule_networks_explained/

原文链接：

https://m.weibo.cn/1402400261/4173136671070127

“完整内容”请点击【阅读原文】

↓↓↓