AOT: Pushing the Efficiency Boundary of Main-memory Triangle Listing,arXiv - CS - Databases

当前位置： X-MOL 学术 › arXiv.cs.DB › 论文详情

Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)

AOT: Pushing the Efficiency Boundary of Main-memory Triangle Listing
arXiv - CS - Databases Pub Date : 2020-06-20 , DOI: arxiv-2006.11494
Michael Yu, Lu Qin, Ying Zhang, Wenjie Zhang, Xuemin Lin

Triangle listing is an important topic significant in many practical applications. Efficient algorithms exist for the task of triangle listing. Recent algorithms leverage an orientation framework, which can be thought of as mapping an undirected graph to a directed acylic graph, namely oriented graph, with respect to any global vertex order. In this paper, we propose an adaptive orientation technique that satisfies the orientation technique but refines it by traversing carefully based on the out-degree of the vertices in the oriented graph during the computation of triangles. Based on this adaptive orientation technique, we design a new algorithm, namely aot, to enhance the edge-iterator listing paradigm. We also make improvements to the performance of aot by exploiting the local order within the adjacent list of the vertices. We show that aot is the first work which can achieve best performance in terms of both practical performance and theoretical time complexity. Our comprehensive experiments over $16$ real-life large graphs show a superior performance of our \aot algorithm when compared against the state-of-the-art, especially for massive graphs with billions of edges. Theoretically, we show that our proposed algorithm has a time complexity of $\Theta(\sum_{ \langle u,v \rangle \in \vec{E} } \min\{ deg^{+}(u),deg^{+}(v)\}))$, where $\vec{E}$ and $deg^{+}(x)$ denote the set of directed edges in an oriented graph and the out-degree of vertex $x$ respectively. As to our best knowledge, this is the best time complexity among in-memory triangle listing algorithms.

中文翻译：

AOT：推动内存三角上市的效率边界

三角形列表是许多实际应用中的重要课题。三角形列表任务存在有效的算法。最近的算法利用了一个方向框架，可以将其视为将无向图映射到有向无向图，即相对于任何全局顶点顺序的有向图。在本文中，我们提出了一种自适应定向技术，该技术满足定向技术，但通过在三角形计算期间基于定向图中顶点的出度仔细遍历来对其进行细化。基于这种自适应定向技术，我们设计了一种新算法，即 aot，以增强边缘迭代器列表范式。我们还通过利用相邻顶点列表中的局部顺序来改进 aot 的性能。我们表明 aot 是第一个在实际性能和理论时间复杂度方面都可以达到最佳性能的工作。我们对 16 美元的真实大型图的综合实验表明，与最先进的算法相比，我们的 \aot 算法具有优越的性能，尤其是对于具有数十亿条边的大规模图。从理论上讲，我们表明我们提出的算法的时间复杂度为 $\Theta(\sum_{ \langle u,v \rangle \in \vec{E} } \min\{ deg^{+}(u),deg^ {+}(v)\}))$，其中 $\vec{E}$ 和 $deg^{+}(x)$ 表示有向图中的有向边集和顶点 $x 的出度$ 分别。据我们所知，这是内存中三角形列表算法中最佳的时间复杂度。我们对 16 美元的真实大图的综合实验表明，与最先进的算法相比，我们的 \aot 算法具有优越的性能，尤其是对于具有数十亿条边的海量图。从理论上讲，我们表明我们提出的算法的时间复杂度为 $\Theta(\sum_{ \langle u,v \rangle \in \vec{E} } \min\{ deg^{+}(u),deg^ {+}(v)\}))$，其中 $\vec{E}$ 和 $deg^{+}(x)$ 表示有向图中的有向边集和顶点 $x 的出度$ 分别。据我们所知，这是内存中三角形列表算法中最佳的时间复杂度。我们对 16 美元的真实大图的综合实验表明，与最先进的算法相比，我们的 \aot 算法具有优越的性能，尤其是对于具有数十亿条边的海量图。从理论上讲，我们表明我们提出的算法的时间复杂度为 $\Theta(\sum_{ \langle u,v \rangle \in \vec{E} } \min\{ deg^{+}(u),deg^ {+}(v)\}))$，其中 $\vec{E}$ 和 $deg^{+}(x)$ 表示有向图中的有向边集和顶点 $x 的出度$ 分别。据我们所知，这是内存中三角形列表算法中最佳的时间复杂度。deg^{+}(v)\}))$，其中 $\vec{E}$ 和 $deg^{+}(x)$ 表示有向图中的有向边集和顶点的出度分别为 $x$。据我们所知，这是内存中三角形列表算法中最佳的时间复杂度。deg^{+}(v)\}))$，其中 $\vec{E}$ 和 $deg^{+}(x)$ 表示有向图中的有向边集和顶点的出度分别为 $x$。据我们所知，这是内存中三角形列表算法中最佳的时间复杂度。

更新日期：2020-06-26

点击分享查看原文

点击收藏

阅读更多本刊最新论文