专栏名称: 3DCV

关注工业3D视觉、SLAM、自动驾驶技术，更专注3D视觉产业的信息传播和产品价值的创造，深度聚焦于3D视觉传感器、SLAM产品，使行业产品快速连接消费者。

三维点云配准 — ICP 算法原理及推导

3DCV · 公众号 · · 2025-02-17 11:00

正文

点击下方卡片，关注 「3DCV」 公众号
选择星标，干货第一时间送达

来源：https://yilingui.xyz/2019/11/20/191120_point_cloud_registration_icp/

编辑：3DCV

扫描下方二维码，加入 「3D视觉从入门到精通」知识星球 ( 点开有惊喜 ) ，星球内凝聚了众多3D视觉实战问题，以及各个模块的学习资料：近20门独家秘制视频课程、最新顶会论文、计算机视觉书籍、优质3D视觉算法源码等。想要入门3D视觉、做项目、搞科研，欢迎扫码加入！

1 问题描述

点云配准（Point Cloud Registration）指的是输入两幅点云 $P s " role="presentation" style=" padding-top: 1px; padding-bottom: 1px; border-width: 0px; border-style: initial; border-color: initial; font-size: 17.08px; vertical-align: baseline; display: inline-block; line-height: 0; font-size-adjust: none; overflow-wrap: normal; word-spacing: normal; float: none; direction: ltr; max-width: none; max-height: none; min-width: 0px; min-height: 0px; "> P s P s$ (source) 和 $P t " role="presentation" style=" padding-top: 1px; padding-bottom: 1px; border-width: 0px; border-style: initial; border-color: initial; font-size: 17.08px; vertical-align: baseline; display: inline-block; line-height: 0; font-size-adjust: none; overflow-wrap: normal; word-spacing: normal; float: none; direction: ltr; max-width: none; max-height: none; min-width: 0px; min-height: 0px; "> P t P t$ (target) ，输出一个变换 $T " role="presentation" style=" padding-top: 1px; padding-bottom: 1px; border-width: 0px; border-style: initial; border-color: initial; font-size: 17.08px; vertical-align: baseline; display: inline-block; line-height: 0; font-size-adjust: none; overflow-wrap: normal; word-spacing: normal; float: none; direction: ltr; max-width: none; max-height: none; min-width: 0px; min-height: 0px; "> T T$ 使得 $T (P s) " role="presentation" style=" padding-top: 1px; padding-bottom: 1px; border-width: 0px; border-style: initial; border-color: initial; font-size: 17.08px; vertical-align: baseline; display: inline-block; line-height: 0; font-size-adjust: none; overflow-wrap: normal; word-spacing: normal; float: none; direction: ltr; max-width: none; max-height: none; min-width: 0px; min-height: 0px; "> T (P s) (adsbygoogle = window.adsbygoogle || []).push({}); T (P s)$ 和 $P t " role="presentation" style=" padding-top: 1px; padding-bottom: 1px; border-width: 0px; border-style: initial; border-color: initial; font-size: 17.08px; vertical-align: baseline; display: inline-block; line-height: 0; font-size-adjust: none; overflow-wrap: normal; word-spacing: normal; float: none; direction: ltr; max-width: none; max-height: none; min-width: 0px; min-height: 0px; "> P t P t$ 的重合程度尽可能高。变换 $T " role="presentation" style=" padding-top: 1px; padding-bottom: 1px; border-width: 0px; border-style: initial; border-color: initial; font-size: 17.08px; vertical-align: baseline; display: inline-block; line-height: 0; font-size-adjust: none; overflow-wrap: normal; word-spacing: normal; float: none; direction: ltr; max-width: none; max-height: none; min-width: 0px; min-height: 0px; "> T T$ 可以是刚性的(rigid)，也可以不是，本文下面只考虑刚性变换，即变换只包括旋转、平移。

点云配准可以分为粗配准（Coarse Registration）和精配准（Fine Registration）两步。粗配准指的是在两幅点云之间的变换完全未知的情况下进行较为粗糙的配准，目的主要是为精配准提供较好的变换初值；精配准则是给定一个初始变换，进一步优化得到更精确的变换。

目前应用最广泛的点云精配准算法是迭代最近点算法（Iterative Closest Point, ICP）及各种变种 ICP 算法。

2 算法描述

对于 $T " role="presentation" style=" padding-top: 1px; padding-bottom: 1px; border-width: 0px; border-style: initial; border-color: initial; font-size: 17.08px; vertical-align: baseline; display: inline-block; line-height: 0; font-size-adjust: none; overflow-wrap: normal; word-spacing: normal; float: none; direction: ltr; max-width: none; max-height: none; min-width: 0px; min-height: 0px; "> T T$ 是刚性变换的情形，点云配准问题可以描述为：

R ∗ , t ∗ = arg min R , t 1 | P s | | P s | ∑ i = 1 | | p i t − ( R ⋅ p i s + t ) | | 2 R ∗ , t ∗ = arg ⁡ min R , t ⁡ 1 | P s | ∑ i = 1 | P s | | | p t i − ( R ⋅ p s i + t ) | | 2

这里 $p s " role="presentation" style=" padding-top: 1px; padding-bottom: 1px; border-width: 0px; border-style: initial; border-color: initial; font-size: 17.08px; vertical-align: baseline; display: inline-block; line-height: 0; font-size-adjust: none; overflow-wrap: normal; word-spacing: normal; float: none; direction: ltr; max-width: none; max-height: none; min-width: 0px; min-height: 0px; "> p (adsbygoogle = window.adsbygoogle || []).push({}); s p s$ 和 $p t " role="presentation" style=" padding-top: 1px; padding-bottom: 1px; border-width: 0px; border-style: initial; border-color: initial; font-size: 17.08px; vertical-align: baseline; display: inline-block; line-height: 0; font-size-adjust: none; overflow-wrap: normal; word-spacing: normal; float: none; direction: ltr; max-width: none; max-height: none; min-width: 0px; min-height: 0px; "> p t p t$ 是源点云和目标点云中的一一对应点。

ICP 算法的直观想法如下：

如果我们知道两幅点云上点的对应关系，那么我们可以用 Least Squares 来求解 R, t 参数；
怎么知道点的对应关系呢？如果我们已经知道了一个大概靠谱的 R, t 参数，那么我们可以通过贪心的方式找两幅点云上点的对应关系（直接找距离最近的点作为对应点）。

ICP 算法实际上就是交替进行上述两个步骤，迭代进行计算，直到收敛。

ICP 一般算法流程为：

点云预处理

滤波、清理数据等

匹配

应用上一步求解出的变换，找最近点

加权

调整一些对应点对的权重

剔除不合理的对应点对

计算 loss

最小化 loss，求解当前最优变换

回到步骤 2. 进行迭代，直到收敛

整体上来看，ICP 把点云配准问题拆分成了两个子问题：

找最近点
找最优变换

2.1 找最近对应点（Find Closet Point）

利用初始 $R 0 、 t 0 " role="presentation" style=" padding-top: 1px; padding-bottom: 1px; border-width: 0px; border-style: initial; border-color: initial; font-size: 17.08px; vertical-align: baseline; display: inline-block; line-height: 0; font-size-adjust: none; overflow-wrap: normal; word-spacing: normal; float: none; direction: ltr; max-width: none; max-height: none; min-width: 0px; min-height: 0px; "> R 0 、 t 0 R 0 、 t 0$ 或上一次迭代得到的 $R k - 1 、 t k - 1 " role="presentation" style=" padding-top: 1px; padding-bottom: 1px; border-width: 0px; border-style: initial; border-color: initial; font-size: 17.08px; vertical-align: baseline; display: inline-block; line-height: 0; font-size-adjust: none; overflow-wrap: normal; word-spacing: normal; float: none; direction: ltr; max-width: none; max-height: none; min-width: 0px; min-height: 0px; "> R k - (adsbygoogle = window.adsbygoogle || []).push({}); 1 、 t k - 1 R k - 1 、 t k - 1$ 对初始点云进行变换，得到一个临时的变换点云，然后用这个点云和目标点云进行比较，找出源点云中每一个点在目标点云中的最近邻点。

如果直接进行比较找最近邻点，需要进行两重循环，计算复杂度为 $O (| P s | \cdot | P t |) " role="presentation" style=" padding-top: 1px; padding-bottom: 1px; border-width: 0px; border-style: initial; border-color: initial; font-size: 17.08px; vertical-align: baseline; display: inline-block; line-height: 0; font-size-adjust: none; overflow-wrap: normal; word-spacing: normal; float: none; direction: ltr; max-width: none; max-height: none; min-width: 0px; min-height: 0px; "> O (| P s | \cdot | P t |) O (| P s | \cdot | P t |)$ ，这一步会比较耗时，常见的加速方法有：

设置距离阈值，当点与点距离小于一定阈值就认为找到了对应点，不用遍历完整个点集；
使用 ANN(Approximate Nearest Neighbor) 加速查找，常用的有 KD-tree；KD-tree 建树的计算复杂度为 O(N log(N)) ，查找通常复杂度为 O(log(N)) （最坏情况下 O(N) ）。

2.2 求解最优变换（Find Best Transform）

对于 point-to-point ICP 问题，求最优变换是有闭形式解（closed-form solution）的，可以借助 SVD 分解来计算。

先给出结论，在已知点的对应关系的情况下，设 $p ¯ s " role="presentation" style=" padding-top: 1px; padding-bottom: 1px; border-width: 0px; border-style: initial; border-color: initial; font-size: 17.08px; vertical-align: baseline; display: inline-block; line-height: 0; font-size-adjust: none; overflow-wrap: normal; word-spacing: normal; float: none; direction: ltr; max-width: none; max-height: none; min-width: 0px; min-height: 0px; "> ¯ p s p ¯ s$ ， $p ¯ t " role="presentation" style=" padding-top: 1px; padding-bottom: 1px; border-width: 0px; border-style: initial; border-color: initial; font-size: 17.08px; vertical-align: baseline; display: inline-block; line-height: 0; font-size-adjust: none; overflow-wrap: normal; word-spacing: normal; float: none; direction: ltr; max-width: none; max-height: none; min-width: 0px; min-height: 0px; "> ¯ p t p ¯ t$ 分别表示源点云和目标点云的质心，令 $p ^ s i = p s i − p ¯ s " role="presentation" style=" padding-top: 1px; padding-bottom: 1px; border-width: 0px; border-style: initial; border-color: initial; font-size: 17.08px; vertical-align: baseline; display: inline-block; line-height: 0; font-size-adjust: none; overflow-wrap: normal; word-spacing: normal; float: none; direction: ltr; max-width: none; max-height: none; min-width: 0px; min-height: 0px; "> ^ p i s = p (adsbygoogle = window.adsbygoogle || []).push({}); i s − ¯ p s p ^ s i = p s i − p ¯ s$ ， $p ^ t i = p t i − p ¯ t " role="presentation" style=" padding-top: 1px; padding-bottom: 1px; border-width: 0px; border-style: initial; border-color: initial; font-size: 17.08px; vertical-align: baseline; display: inline-block; line-height: 0; font-size-adjust: none; overflow-wrap: normal; word-spacing: normal; float: none; direction: ltr; max-width: none; max-height: none; min-width: 0px; min-height: 0px; "> ^ p i t = p i t − ¯ (adsbygoogle = window.adsbygoogle || []).push({}); p t p ^ t i = p t i − p ¯ t$ ，令 $H = ∑ i = 1 | P s | p ^ s i p ^ t i T " role="presentation" style=" padding-top: 1px; padding-bottom: 1px; border-width: 0px; border-style: initial; border-color: initial; font-size: 17.08px; vertical-align: baseline; display: inline-block; line-height: 0; font-size-adjust: none; overflow-wrap: normal; word-spacing: normal; float: none; direction: ltr; max-width: none; max-height: none; min-width: 0px; min-height: 0px; "> H = ∑ | P s | i = 1 ^ p i (adsbygoogle = window.adsbygoogle || []).push({}); s ^ p i t T H = ∑ i = 1 | P s | p ^ s i p ^ t i T$ ，这是一个 3x3 矩阵，对 H 进行 SVD 分解得到 $H = U Σ V T " role="presentation" style=" padding-top: 1px; padding-bottom: 1px; border-width: 0px; border-style: initial; border-color: initial; font-size: 17.08px; vertical-align: baseline; display: inline-block; line-height: 0; font-size-adjust: none; overflow-wrap: normal; word-spacing: normal; float: none; direction: ltr; max-width: none; max-height: none; min-width: 0px; min-height: 0px; "> H = U Σ V T H = U Σ V T$ ，则 point-to-point ICP 问题最优旋转为：

$R * = V U T " role="presentation" style=" padding-top: 1px; padding-bottom: 1px; border-width: 0px; border-style: initial; border-color: initial; font-size: 17.08px; vertical-align: baseline; display: inline-block; line-height: 0; font-size-adjust: none; overflow-wrap: normal; word-spacing: normal; float: none; direction: ltr; max-width: none; max-height: none; min-width: 0px; min-height: 0px; "> R * = (adsbygoogle = window.adsbygoogle || []).push({}); V U T R * = V U T$

最优平移为：

$t * = p ¯ t - R * p ¯ s " role="presentation" style=" padding-top: 1px; padding-bottom: 1px; border-width: 0px; border-style: initial; border-color: initial; font-size: 17.08px; vertical-align: baseline; display: inline-block; line-height: 0; font-size-adjust: none; overflow-wrap: normal; word-spacing: normal; float: none; direction: ltr; max-width: none; max-height: none; min-width: 0px; min-height: 0px; "> t * = ¯ p t - R * ¯ p s t * = p ¯ t - R * p ¯ s (adsbygoogle = window.adsbygoogle || []).push({});$

下面分别给出证明。

2.2.1 计算最优平移

令 $N = | P s | " role="presentation" style=" padding-top: 1px; padding-bottom: 1px; border-width: 0px; border-style: initial; border-color: initial; font-size: 17.08px; vertical-align: baseline; display: inline-block; line-height: 0; font-size-adjust: none; overflow-wrap: normal; word-spacing: normal; float: none; direction: ltr; max-width: none; max-height: none; min-width: 0px; min-height: 0px; "> N = | P s | N = | P s |$ ，设 $F (t) = \sum i = 1 N | | (R \cdot p s i + t) - p t i | | 2 " role="presentation" style=" padding-top: 1px; padding-bottom: 1px; border-width: 0px; border-style: initial; border-color: initial; font-size: 17.08px; vertical-align: baseline; display: inline-block; line-height: 0; font-size-adjust: none; overflow-wrap: normal; word-spacing: normal; float: none; direction: ltr; max-width: none; max-height: none; min-width: 0px; min-height: 0px; "> F (t) = \sum N i = 1 | (adsbygoogle = window.adsbygoogle || []).push({}); | (R \cdot p i s + t) - p i t | | 2 F (t) = \sum i = 1 N | | (R \cdot p s i + t) - p t i | | 2$ ，对其进行求导，则有：

∂ F ∂ t = N ∑ i = 1 2 ( R ⋅ p i s + t − p i t ) = 2 n t + 2 R N ∑ i = 1 p i s − 2 N ∑ i = 1 p i t ∂ F ∂ t = ∑ i = 1 N 2 ( R ⋅ p s i + t − p t i ) = 2 n t + 2 R ∑ i = 1 N p s i − 2 ∑ i = 1 N p t i

令导数为 0 ，则有：

$t = 1 N \sum i = 1 N p t i - R 1 N \sum i = 1 N p s i = p ¯ t - R p ¯ s " role="presentation" style=" padding-top: 1px; padding-bottom: 1px; border-width: 0px; border-style: initial; border-color: initial; font-size: 17.08px; vertical-align: baseline; display: inline-block; line-height: 0; font-size-adjust: none; overflow-wrap: normal; word-spacing: normal; float: none; direction: ltr; max-width: none; max-height: none; min-width: 0px; min-height: 0px; "> t = (adsbygoogle = window.adsbygoogle || []).push({}); 1 N N \sum i = 1 p i t - R 1 N N \sum (adsbygoogle = window.adsbygoogle || []).push({}); i = 1 p i s = ¯ p t - R ¯ p s t = 1 N \sum i = 1 N p t i - R 1 N \sum i = 1 N p s i = p ¯ t - R p ¯ s (adsbygoogle = window.adsbygoogle || []).push({});$

无论 R 取值如何，根据上式我们都可以求得最优的 t，使得 loss 最小。下面我们来推导最优旋转的计算公式。

2.2.2 计算最优旋转

经过最优平移的推导，我们知道无论旋转如何取值，都可以通过计算点云的质心来得到最优平移，为了下面计算上的简便，我们不妨不考虑平移的影响，先将源点云和目标点云都转换到质心坐标下，这也就是之前令 $p^s i = p s i - p ¯ s " role="presentation" style=" padding-top: 1px; padding-bottom: 1px; border-width: 0px; border-style: initial; border-color: initial; font-size: 17.08px; vertical-align: baseline; display: inline-block; line-height: 0; font-size-adjust: none; overflow-wrap: normal; word-spacing: normal; float: none; direction: ltr; max-width: none; max-height: none; min-width: 0px; min-height: 0px; ">^p i s = p i s - ¯ p s p^s i = p s i - p ¯ s$ ， $p ^ t i = p t i − p ¯ t " role="presentation" style=" padding-top: 1px; padding-bottom: 1px; border-width: 0px; border-style: initial; border-color: initial; font-size: 17.08px; vertical-align: baseline; display: inline-block; line-height: 0; font-size-adjust: none; overflow-wrap: normal; word-spacing: normal; float: none; direction: ltr; max-width: none; max-height: none; min-width: 0px; min-height: 0px; "> ^ (adsbygoogle = window.adsbygoogle || []).push({}); p i t = p i t − ¯ p t p ^ t i = p t i − p ¯ t$ 的意义。

下面我们用 $p ^ s i " role="presentation" style=" padding-top: 1px; padding-bottom: 1px; border-width: 0px; border-style: initial; border-color: initial; font-size: 17.08px; vertical-align: baseline; display: inline-block; line-height: 0; font-size-adjust: none; overflow-wrap: normal; word-spacing: normal; float: none; direction: ltr; max-width: none; max-height: none; min-width: 0px; min-height: 0px; "> ^ p i s p ^ s i (adsbygoogle = window.adsbygoogle || []).push({});$ 和 $p^t i " role="presentation" style=" padding-top: 1px; padding-bottom: 1px; border-width: 0px; border-style: initial; border-color: initial; font-size: 17.08px; vertical-align: baseline; display: inline-block; line-height: 0; font-size-adjust: none; overflow-wrap: normal; word-spacing: normal; float: none; direction: ltr; max-width: none; max-height: none; min-width: 0px; min-height: 0px; ">^p i t p^t i$ 进行推导。

不考虑平移，则 loss 可以写成：

$F ( R ) = ∑ i = 1 N | | R ⋅ p ^ s i − p ^ t i | | 2 " role="presentation" style=" padding-top: 1px; padding-bottom: 1px; border-width: 0px; border-style: initial; border-color: initial; font-size: 17.08px; vertical-align: baseline; display: inline-block; line-height: 0; font-size-adjust: none; overflow-wrap: normal; word-spacing: normal; float: none; direction: ltr; max-width: none; max-height: none; min-width: 0px; min-height: 0px; "> F ( R ) = N ∑ i = 1 | (adsbygoogle = window.adsbygoogle || []).push({}); | R ⋅ ^ p i s − ^ p i t | | 2 F ( R ) = ∑ i = 1 N | | R ⋅ p ^ s i − p ^ t i | | 2$

先化简 $| | R ⋅ p ^ s i − p ^ t i | | 2 " role="presentation" style=" padding-top: 1px; padding-bottom: 1px; border-width: 0px; border-style: initial; border-color: initial; font-size: 17.08px; vertical-align: baseline; display: inline-block; line-height: 0; font-size-adjust: none; overflow-wrap: normal; word-spacing: normal; float: none; direction: ltr; max-width: none; max-height: none; min-width: 0px; min-height: 0px; "> | (adsbygoogle = window.adsbygoogle || []).push({}); | R ⋅ ^ p i s − ^ p i t | | 2 | | R ⋅ p ^ s i − p ^ t i | | 2$ ：

| | R ⋅ ^ p i s − ^ p i t | | 2 = ( R ⋅ ^ p i s − ^ p i t ) T ( R ⋅ ^ p i s − ^ p i t ) = ( ^ p i s T R T − ^ p i t T ) ( R ⋅ ^ p i s − ^ p i t ) = ^ p i s T R T R ^ p i s − ^ p i t T R ^ p i s − ^ p i s T R T ^ p i t + ^ p i t T ^ p i t = | | ^ p i s | | 2 + | | ^ p i t | | 2 − ^ p i t T R ^ p i s − ^ p i s T R T ^ p i t = | | ^ p i s | | 2 + | | ^ p i t | | 2 − 2 ^ p i t T R ^ p i s | | R ⋅ p ^ s i − p ^ t i | | 2 = ( R ⋅ p ^ s i − p ^ t i ) T ( R ⋅ p ^ s i − p ^ t i ) = ( p ^ s i T R T − p ^ t i T ) ( R ⋅ p ^ s i − p ^ t i ) = p ^ s i T R T R p ^ s i − p ^ t i T R p ^ s i − p ^ s i T R T p ^ t i + p ^ t i T p ^ t i = | | p ^ s i | | 2 + | | p ^ t i | | 2 − p ^ t i T R p ^ s i − p ^ s i T R T p ^ t i = | | p ^ s i | | 2 + | | p ^ t i | | 2 − 2 p ^ t i T R p ^ s i

这里利用到了 $R T R = I " role="presentation" style=" padding-top: 1px; padding-bottom: 1px; border-width: 0px; border-style: initial; border-color: initial; font-size: 17.08px; vertical-align: baseline; display: inline-block; line-height: 0; font-size-adjust: none; overflow-wrap: normal; word-spacing: normal; float: none; direction: ltr; max-width: none; max-height: none; min-width: 0px; min-height: 0px; "> R T R = I R T R = I$ 和 $p ^ t i T R p ^ s i = p ^ s i T R T p ^ t i " role="presentation" style=" padding-top: 1px; padding-bottom: 1px; border-width: 0px; border-style: initial; border-color: initial; font-size: 17.08px; vertical-align: baseline; display: inline-block; line-height: 0; font-size-adjust: none; overflow-wrap: normal; word-spacing: normal; float: none; direction: ltr; max-width: none; max-height: none; min-width: 0px; min-height: 0px; "> ^ (adsbygoogle = window.adsbygoogle || []).push({}); p i t T R ^ p i s = ^ p i s T R (adsbygoogle = window.adsbygoogle || []).push({}); T ^ p i t p ^ t i T R p ^ s i = p ^ s i T R T p ^ t i$ （标量的转置等于自身）的性质。

由于点的坐标是确定的(和 R 无关)，所以最小化原 loss 等价于求：

$R ∗ = arg ⁡ min R ⁡ ( − 2 ∑ i = 1 N p ^ t i T R p ^ s i ) " role="presentation" style=" padding-top: 1px; padding-bottom: 1px; border-width: 0px; border-style: initial; border-color: initial; font-size: 17.08px; vertical-align: baseline; display: inline-block; line-height: 0; font-size-adjust: none; overflow-wrap: normal; word-spacing: normal; float: none; direction: ltr; max-width: none; max-height: none; min-width: 0px; min-height: 0px; "> R ∗ = arg min R (adsbygoogle = window.adsbygoogle || []).push({}); ( − 2 N ∑ i = 1 ^ p i t T R ^ p i s ) (adsbygoogle = window.adsbygoogle || []).push({}); R ∗ = arg ⁡ min R ⁡ ( − 2 ∑ i = 1 N p ^ t i T R p ^ s i )$

也即为求：

$R ∗ = arg ⁡ max R ⁡ ( ∑ i = 1 N p ^ t i T R p ^ s i ) " role="presentation" style=" padding-top: 1px; padding-bottom: 1px; border-width: 0px; border-style: initial; border-color: initial; font-size: 17.08px; vertical-align: baseline; display: inline-block; line-height: 0; font-size-adjust: none; overflow-wrap: normal; word-spacing: normal; float: none; direction: ltr; max-width: none; max-height: none; min-width: 0px; min-height: 0px; "> R ∗ = arg max R ( N ∑ i = 1 (adsbygoogle = window.adsbygoogle || []).push({}); ^ p i t T R ^ p i s ) R ∗ = arg ⁡ max R ⁡ ( ∑ i = 1 N p ^ t i T R p ^ s i )$

注意到 $∑ i = 1 N p ^ t i T R p ^ s i = t r a c e ( P t T R P s ) " role="presentation" style=" padding-top: 1px; padding-bottom: 1px; border-width: 0px; border-style: initial; border-color: initial; font-size: 17.08px; vertical-align: baseline; display: inline-block; line-height: 0; font-size-adjust: none; overflow-wrap: normal; word-spacing: normal; float: none; direction: ltr; max-width: none; max-height: none; min-width: 0px; min-height: 0px; "> ∑ N (adsbygoogle = window.adsbygoogle || []).push({}); i = 1 ^ p i t T R ^ p i s = t r a c e ( P (adsbygoogle = window.adsbygoogle || []).push({}); T t R P s ) ∑ i = 1 N p ^ t i T R p ^ s i = t r a c e ( P t T R P s )$ （由矩阵乘法及 trace 的定义可得）

则问题转化为：

$R * = arg max R t r a c e (P t T R P s) " role="presentation" style=" padding-top: 1px; padding-bottom: 1px; border-width: 0px; border-style: initial; border-color: initial; font-size: 17.08px; vertical-align: baseline; display: inline-block; line-height: 0; font-size-adjust: none; overflow-wrap: normal; word-spacing: normal; float: none; direction: ltr; max-width: none; max-height: none; min-width: 0px; min-height: 0px; "> R * = arg max R t r a (adsbygoogle = window.adsbygoogle || []).push({}); c e (P T t R P s) R * = arg max R t r a c e (P t T R P s)$

根据 trace 的性质 $t r a c e (A B) = t r a c e (B A) " role="presentation" style=" padding-top: 1px; padding-bottom: 1px; border-width: 0px; border-style: initial; border-color: initial; font-size: 17.08px; vertical-align: baseline; display: inline-block; line-height: 0; font-size-adjust: none; overflow-wrap: normal; word-spacing: normal; float: none; direction: ltr; max-width: none; max-height: none; min-width: 0px; min-height: 0px; "> t r a c e (A B) = t r a c e ((adsbygoogle = window.adsbygoogle || []).push({}); B A) t r a c e (A B) = t r a c e (B A)$ ，（这里不要求 A, B 为方阵，只要 A*B 是方阵即可），有：

$t r a c e (P t T R P s) = t r a c e (R P s P t T) " role="presentation" style=" padding-top: 1px; padding-bottom: 1px; border-width: 0px; border-style: initial; border-color: initial; font-size: 17.08px; vertical-align: baseline; display: inline-block; line-height: 0; font-size-adjust: none; overflow-wrap: normal; word-spacing: normal; float: none; direction: ltr; max-width: none; max-height: none; min-width: 0px; min-height: 0px; "> t r a c e (P T t R P s) = t r a c e (R (adsbygoogle = window.adsbygoogle || []).push({}); P s P T t) t r a c e (P t T R P s) = t r a c e (R P s P t T)$

还记得前面定义的矩阵 H 和其 SVD 分解吗？带入上式得到：

t r a c e ( P T t R P s ) = t r a c e ( R P s P T t ) = t r a c e ( R H ) = t r a c e ( R U Σ V T ) = t r a c e ( Σ V T R U ) t r a c e ( P t T R P s ) = t r a c e ( R P s P t T ) = t r a c e ( R H ) = t r a c e ( R U Σ V T ) = t r a c e ( Σ V T R U )

注意这里 $V, U, R " role="presentation" style=" padding-top: 1px; padding-bottom: 1px; border-width: 0px; border-style: initial; border-color: initial; font-size: 17.08px; vertical-align: baseline; display: inline-block; line-height: 0; font-size-adjust: none; overflow-wrap: normal; word-spacing: normal; float: none; direction: ltr; max-width: none; max-height: none; min-width: 0px; min-height: 0px; "> V, U, R V, U, R$ 都是正交矩阵（orthogonal matrices），所以 $V T R U " role="presentation" style=" padding-top: 1px; padding-bottom: 1px; border-width: 0px; border-style: initial; border-color: initial; font-size: 17.08px; vertical-align: baseline; display: inline-block; line-height: 0; font-size-adjust: none; overflow-wrap: normal; word-spacing: normal; float: none; direction: ltr; max-width: none; max-height: none; min-width: 0px; min-height: 0px; "> V T R U V T R U$ 也是正交矩阵。令 $M = V T R U = [m 11 m 12 m 13 m 21 m 22 m 23 m 31 m 32 m 33] " role="presentation" style=" padding-top: 1px; padding-bottom: 1px; border-width: 0px; border-style: initial; border-color: initial; font-size: 17.08px; vertical-align: baseline; display: inline-block; line-height: 0; font-size-adjust: none; overflow-wrap: normal; word-spacing: normal; float: none; direction: ltr; max-width: none; max-height: none; min-width: 0px; min-height: 0px; "> M = V T R U = ⎡ ⎢ ⎣ m 11 m 12 m$