On Approximating the Number of $k$-Cliques in Sublinear Time,SIAM Journal on Computing

当前位置： X-MOL 学术 › SIAM J. Comput. › 论文详情

Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)

On Approximating the Number of $k$-Cliques in Sublinear Time
SIAM Journal on Computing ( IF 1.6 ) Pub Date : 2020-07-14 , DOI: 10.1137/18m1176701
Talya Eden , Dana Ron , C. Seshadhri

SIAM Journal on Computing, Volume 49, Issue 4, Page 747-771, January 2020.
We study the problem of approximating the number of $k$-cliques in a graph when given query access to the graph. We consider the standard query model for general graphs via (1) degree queries, (2) neighbor queries, and (3) pair queries. Let $n$ denote the number of vertices in the graph, $m$ the number of edges, and $C_k$ the number of $k$-cliques. We design an algorithm that outputs a $(1+\varepsilon)$-approximation (with high probability) for $C_k$, whose expected query complexity and running time are $O(\frac{n}{C_k^{1/k}}+\frac{m^{k/2}}{C_k} ){poly}(\log n, 1/\varepsilon,k)$. Hence, the complexity of the algorithm is sublinear in the size of the graph for $C_k = \omega(m^{k/2-1})$. Furthermore, we prove a lower bound showing that the query complexity of our algorithm is essentially optimal (up to the dependence on $\log n$, $1/\varepsilon$, and $k$). The previous results in this vein are by Feige [SIAM J. Comput., 35 (2006), pp. 964--984] and by Goldreich and Ron [Random Structures Algorithms, 32 (2008), pp. 473--493] for edge counting ($k=2$) and by Eden, Levi, Ron, and Seshadhri [SIAM J. Comput., 46 (2017), pp. 1603--1646] for triangle counting ($k=3$). Our result matches the complexities of these results. The previous result by Eden et al. hinges on a certain amortization technique that works only for triangle counting and does not generalize for larger cliques. We obtain a general algorithm that works for any $k\geq 3$ by designing a procedure that samples each $k$-clique incident to one of the vertices of a given set $S$ of vertices with approximately equal probability.

中文翻译：

关于近似亚线性时间的$ k $-团数

SIAM计算杂志，第49卷，第4期，第747-771页，2020年1月。
我们研究了在给定对图表的查询访问权时，近似计算图中$ k $ -clique数的问题。我们通过（1）度查询，（2）邻居查询和（3）对查询来考虑通用图的标准查询模型。令$ n $表示图中的顶点数，$ m $表示边的数量，$ C_k $表示$ k $斜点的数量。我们设计了一种算法，该算法为$ C_k $输出（高概率）$（1+ \ varepsilon）$近似值，其预期查询复杂度和运行时间为$ O（\ frac {n} {C_k ^ {1 / k }} + \ frac {m ^ {k / 2}} {C_k}）{poly}（\ log n，1 / \ varepsilon，k）$。因此，对于$ C_k = \ omega（m ^ {k / 2-1}）$，算法的复杂度在图的大小上是次线性的。此外，我们证明了下界，表明我们算法的查询复杂度实际上是最佳的（取决于对$ \ log n $，$ 1 / \ varepsilon $，和$ k $）。以前的结果是由Feige [SIAM J. Comput。，35（2006），第964--984页]和Goldreich和Ron [Random Structures Algorithms，32（2008），第473--493页]得出的。边缘计数（$ k = 2 $），由Eden，Levi，Ron和Seshadhri [SIAM J. Comput。，46（2017），第1603--1646页]进行三角形计数（$ k = 3 $）。我们的结果与这些结果的复杂性相符。Eden等人的先前结果。取决于某种摊销技术，该技术仅适用于三角计数，不适用于较大的集团。通过设计一个过程，以近似相等的概率对给定集合$ S $的一个顶点的每个$ k $ -clique事件进行采样，我们获得了适用于任何$ k \ geq 3 $的通用算法。964--984]和Goldreich和Ron [Random Structures Algorithms，32（2008），pp。473--493]用于边缘计数（$ k = 2 $），以及Eden，Levi，Ron和Seshadhri [SIAM J] [Comput。，46（2017），pp。1603--1646]进行三角形计数（$ k = 3 $）。我们的结果与这些结果的复杂性相符。Eden等人的先前结果。取决于某种摊销技术，该技术仅适用于三角计数，不适用于较大的集团。通过设计一个过程，以近似相等的概率采样到给定集合$ S $的一个顶点的每个$ k $ -clique事件，我们获得了适用于任何$ k \ geq 3 $的通用算法。964--984]和Goldreich和Ron [Random Structures Algorithms，32（2008），pp。473--493]用于边缘计数（$ k = 2 $），以及Eden，Levi，Ron和Seshadhri [SIAM J] [Comput。，46（2017），pp。1603--1646]进行三角形计数（$ k = 3 $）。我们的结果与这些结果的复杂性相符。Eden等人的先前结果。取决于某种摊销技术，该技术仅适用于三角计数，不适用于较大的集团。通过设计一个过程，以近似相等的概率采样到给定集合$ S $的一个顶点的每个$ k $ -clique事件，我们获得了适用于任何$ k \ geq 3 $的通用算法。1603--1646]用于三角形计数（$ k = 3 $）。我们的结果与这些结果的复杂性相符。Eden等人的先前结果。取决于某种摊销技术，该技术仅适用于三角计数，不适用于较大的集团。通过设计一个过程，以近似相等的概率采样到给定集合$ S $的一个顶点的每个$ k $ -clique事件，我们获得了适用于任何$ k \ geq 3 $的通用算法。1603--1646]用于三角形计数（$ k = 3 $）。我们的结果与这些结果的复杂性相符。Eden等人的先前结果。取决于某种摊销技术，该技术仅适用于三角计数，不适用于较大的集团。通过设计一个过程，以近似相等的概率对给定集合$ S $的一个顶点的每个$ k $ -clique事件进行采样，我们获得了适用于任何$ k \ geq 3 $的通用算法。

更新日期：2020-07-23

点击分享查看原文

点击收藏

公开下载

阅读更多本刊最新论文本刊介绍/投稿指南

全部期刊列表>>