Approximation algorithms for 1-Wasserstein distance between persistence diagrams,arXiv - CS - Computational Geometry

当前位置： X-MOL 学术 › arXiv.cs.CG › 论文详情

Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)

Approximation algorithms for 1-Wasserstein distance between persistence diagrams
arXiv - CS - Computational Geometry Pub Date : 2021-04-15 , DOI: arxiv-2104.07710
Samantha Chen, Yusu Wang

Recent years have witnessed a tremendous growth using topological summaries, especially the persistence diagrams (encoding the so-called persistent homology) for analyzing complex shapes. Intuitively, persistent homology maps a potentially complex input object (be it a graph, an image, or a point set and so on) to a unified type of feature summary, called the persistence diagrams. One can then carry out downstream data analysis tasks using such persistence diagram representations. A key problem is to compute the distance between two persistence diagrams efficiently. In particular, a persistence diagram is essentially a multiset of points in the plane, and one popular distance is the so-called 1-Wasserstein distance between persistence diagrams. In this paper, we present two algorithms to approximate the 1-Wasserstein distance for persistence diagrams in near-linear time. These algorithms primarily follow the same ideas as two existing algorithms to approximate optimal transport between two finite point-sets in Euclidean spaces via randomly shifted quadtrees. We show how these algorithms can be effectively adapted for the case of persistence diagrams. Our algorithms are much more efficient than previous exact and approximate algorithms, both in theory and in practice, and we demonstrate its efficiency via extensive experiments. They are conceptually simple and easy to implement, and the code is publicly available in github.

中文翻译：

余辉图之间的1-Wasserstein距离的近似算法

近年来，使用拓扑摘要（尤其是用于分析复杂形状的持久性图（编码所谓的持久性同源性））见证了巨大的增长。直观上，持久性同源性将潜在复杂的输入对象（它是图形，图像或点集等）映射到统一的特征摘要类型，称为持久性图。然后，可以使用这种持久性图表示来执行下游数据分析任务。一个关键问题是有效地计算两个余辉图之间的距离。尤其是，一个持久性图本质上是平面中多个点的集合，一个流行的距离是持久性图之间的所谓的1-Wasserstein距离。在本文中，我们提出了两种算法来近似近似线性时间内的余辉图的1-Wasserstein距离。这些算法主要遵循与两个现有算法相同的思想，以通过随机移位的四叉树来近似欧几里得空间中两个有限点集之间的最佳传输。我们展示了如何针对持久性图有效地调整这些算法。在理论上和实践上，我们的算法都比以前的精确算法和逼近算法有效得多，并且我们通过广泛的实验证明了它的效率。它们在概念上很简单且易于实现，并且代码可以在github上公开获得。这些算法主要遵循与两个现有算法相同的思想，以通过随机移位的四叉树来近似欧几里得空间中两个有限点集之间的最佳传输。我们展示了如何针对持久性图有效地调整这些算法。在理论上和实践上，我们的算法都比以前的精确算法和逼近算法有效得多，并且我们通过广泛的实验证明了它的效率。它们在概念上很简单且易于实现，并且代码可以在github上公开获得。这些算法主要遵循与两个现有算法相同的思想，以通过随机移位的四叉树来近似欧几里得空间中两个有限点集之间的最佳传输。我们展示了如何针对持久性图有效地调整这些算法。在理论上和实践上，我们的算法都比以前的精确算法和逼近算法有效得多，并且我们通过广泛的实验证明了它的效率。它们在概念上很简单且易于实现，并且代码可以在github上公开获得。并且我们通过广泛的实验证明了其效率。它们在概念上很简单且易于实现，并且代码可以在github上公开获得。并且我们通过广泛的实验证明了其效率。它们在概念上很简单且易于实现，并且代码可以在github上公开获得。

更新日期：2021-04-19

点击分享查看原文

点击收藏

阅读更多本刊最新论文

全部期刊列表>>