当前位置: X-MOL 学术Appl. Netw. Sci. › 论文详情
Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)
Motif-based spectral clustering of weighted directed networks
Applied Network Science ( IF 1.3 ) Pub Date : 2020-09-10 , DOI: 10.1007/s41109-020-00293-z
William G. Underwood , Andrew Elliott , Mihai Cucuringu

Clustering is an essential technique for network analysis, with applications in a diverse range of fields. Although spectral clustering is a popular and effective method, it fails to consider higher-order structure and can perform poorly on directed networks. One approach is to capture and cluster higher-order structures using motif adjacency matrices. However, current formulations fail to take edge weights into account, and thus are somewhat limited when weight is a key component of the network under study.We address these shortcomings by exploring motif-based weighted spectral clustering methods. We present new and computationally useful matrix formulae for motif adjacency matrices on weighted networks, which can be used to construct efficient algorithms for any anchored or non-anchored motif on three nodes. In a very sparse regime, our proposed method can handle graphs with a million nodes and tens of millions of edges. We further use our framework to construct a motif-based approach for clustering bipartite networks.We provide comprehensive experimental results, demonstrating (i) the scalability of our approach, (ii) advantages of higher-order clustering on synthetic examples, and (iii) the effectiveness of our techniques on a variety of real world data sets; and compare against several techniques from the literature. We conclude that motif-based spectral clustering is a valuable tool for analysis of directed and bipartite weighted networks, which is also scalable and easy to implement.

中文翻译:

基于主题的加权有向网络频谱聚类

群集是网络分析的一项必不可少的技术,其应用范围广泛。尽管频谱聚类是一种流行且有效的方法,但它无法考虑高阶结构,并且在定向网络上的性能较差。一种方法是使用基序邻接矩阵捕获和聚类高阶结构。但是,当前的公式未能考虑边缘权重,因此当权重是所研究网络的关键组成部分时,其局限性受到一定限制。我们通过探索基于基序的加权谱聚类方法来解决这些缺点。我们为加权网络上的主题邻接矩阵提供了新的且对计算有用的矩阵公式,可用于为三个节点上的任何锚定或非锚定主题构建有效的算法。在一个非常稀疏的政权中 我们提出的方法可以处理具有一百万个节点和数千万条边的图。我们进一步使用我们的框架来构建基于主题的二聚网络聚类方法。我们提供了全面的实验结果,展示了(i)该方法的可扩展性,(ii)在合成示例上进行高阶聚类的优势,以及(iii)我们的技术在各种现实世界数据集上的有效性;并与文献中的几种技术进行比较。我们得出结论,基于基序的频谱聚类是用于分析有向和双向加权网络的有价值的工具,它也是可扩展的并且易于实现。我们提供了全面的实验结果,展示了(i)我们方法的可扩展性,(ii)在合成示例上进行高阶聚类的优势,以及(iii)我们的技术在各种现实世界数据集上的有效性;并与文献中的几种技术进行比较。我们得出结论,基于基序的频谱聚类是用于分析有向和双向加权网络的有价值的工具,它也是可扩展的并且易于实现。我们提供了综合的实验结果,展示了(i)我们方法的可扩展性,(ii)在合成示例上进行高阶聚类的优势,以及(iii)我们的技术在各种现实世界数据集上的有效性;并与文献中的几种技术进行比较。我们得出结论,基于基序的频谱聚类是用于分析有向和双向加权网络的有价值的工具,它也是可扩展的并且易于实现。
更新日期:2020-09-10
down
wechat
bug