Doubly Stochastic Subspace Clustering,arXiv - CS - Machine Learning

当前位置： X-MOL 学术 › arXiv.cs.LG › 论文详情

Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)

Doubly Stochastic Subspace Clustering
arXiv - CS - Machine Learning Pub Date : 2020-11-30 , DOI: arxiv-2011.14859
Derek Lim, René Vidal, Benjamin D. Haeffele

Many state-of-the-art subspace clustering methods follow a two-step process by first constructing an affinity matrix between data points and then applying spectral clustering to this affinity. Most of the research into these methods focuses on the first step of generating the affinity matrix, which often exploits the self-expressive property of linear subspaces, with little consideration typically given to the spectral clustering step that produces the final clustering. Moreover, existing methods obtain the affinity by applying ad-hoc postprocessing steps to the self-expressive representation of the data, and this postprocessing can have a significant impact on the subsequent spectral clustering step. In this work, we propose to unify these two steps by jointly learning both a self-expressive representation of the data and an affinity matrix that is well-normalized for spectral clustering. In the proposed model, we constrain the affinity matrix to be doubly stochastic, which results in a principled method for affinity matrix normalization while also exploiting the known benefits of doubly stochastic normalization in spectral clustering. While our proposed model is non-convex, we give a convex relaxation that is provably equivalent in many regimes; we also develop an efficient approximation to the full model that works well in practice. Experiments show that our method achieves state-of-the-art subspace clustering performance on many common datasets in computer vision.

中文翻译：

双随机子空间聚类

许多最新的子空间聚类方法遵循两步过程，首先在数据点之间构造一个亲和力矩阵，然后将光谱聚类应用于此亲和力。这些方法的大多数研究都集中在生成亲和力矩阵的第一步，该步骤通常利用线性子空间的自表达特性，而通常很少考虑产生最终聚类的光谱聚类步骤。此外，现有方法通过将临时后处理步骤应用于数据的自表达表示来获得亲和力，并且这种后处理可能对后续的频谱聚类步骤产生重大影响。在这项工作中我们建议通过共同学习数据的自表示形式和针对光谱聚类良好规范化的亲和力矩阵来统一这两个步骤。在提出的模型中，我们将亲和矩阵约束为双随机的，这导致了一种原则上的亲和矩阵归一化的方法，同时还利用了频谱聚类中双随机归一化的已知好处。虽然我们提出的模型是非凸的，但是我们给出了在许多情况下等效的凸松弛。我们还为在实践中运行良好的完整模型开发了有效的近似值。实验表明，我们的方法在计算机视觉中的许多常见数据集上都实现了最新的子空间聚类性能。我们将亲和矩阵约束为双随机的，这导致了一种原则上的亲和矩阵归一化的方法，同时还利用了频谱聚类中双随机归一化的已知好处。虽然我们提出的模型是非凸的，但是我们给出了在许多情况下等效的凸松弛。我们还为在实践中运行良好的完整模型开发了有效的近似值。实验表明，我们的方法在计算机视觉中的许多常见数据集上都实现了最新的子空间聚类性能。我们将亲和矩阵约束为双随机的，这导致了一种原则上的亲和矩阵归一化的方法，同时还利用了频谱聚类中双随机归一化的已知好处。虽然我们提出的模型是非凸的，但是我们给出了在许多情况下等效的凸松弛。我们还为在实践中运行良好的完整模型开发了有效的近似值。实验表明，我们的方法在计算机视觉中的许多常见数据集上都实现了最新的子空间聚类性能。我们给出了在许多情况下等效的凸松弛。我们还为在实践中运行良好的完整模型开发了有效的近似值。实验表明，我们的方法在计算机视觉中的许多常见数据集上都实现了最新的子空间聚类性能。我们给出了在许多情况下等效的凸松弛。我们还为在实践中运行良好的完整模型开发了有效的近似值。实验表明，我们的方法在计算机视觉中的许多常见数据集上都实现了最新的子空间聚类性能。

更新日期：2020-12-01

点击分享查看原文

点击收藏

阅读更多本刊最新论文