Information Sciences ( IF 8.1 ) Pub Date : 2021-04-12 , DOI: 10.1016/j.ins.2021.04.027 Yizhang He , Kai Wang , Wenjie Zhang , Xuemin Lin , Ying Zhang
We propose a novel cohesive subgraph model called -strengthened -core (denoted as -core), which is the first to consider both tie strength and vertex engagement on bipartite graphs. An edge is a strong tie if contained in at least butterflies (-bicliques). -core requires each vertex on the upper or lower level to have at least or strong ties, given strength level . To retrieve the vertices of -core optimally, we construct index to store all -cores. Effective optimization techniques are proposed to improve index construction. To make our idea practical on large graphs, we propose 2D-indexes , and that selectively store the vertices of -core for some , and . The 2D-indexes are more space-efficient and require less construction time, each of which can support -core queries. As query efficiency depends on input parameters and the choice of 2D-index, we propose a learning-based hybrid computation paradigm by training a feed-forward neural network to predict the optimal choice of 2D-index that minimizes the query time. Extensive experiments show that (1) -core is an effective model capturing unique and important cohesive subgraphs; (2) the proposed techniques significantly improve the efficiency of index construction and query processing.
中文翻译:
在二部图中探索具有顶点接合和结合强度的内聚子图
我们提出了一种新的内聚子图模型,称为 加强 核心(表示为 -core),这是第一个在二部图上同时考虑连接强度和顶点接合的情况。如果至少包含一条边,则该边是一条牢固的领带 蝴蝶(-bicliques)。 -core要求上层或下层的每个顶点至少具有 或者 牢固的关系,给定强度水平 。检索的顶点最佳,我们构造索引 储存所有 -核心。提出了有效的优化技术来改善指标的构建。为了使我们的想法在大图上可行,我们提出了2D索引, 和 有选择地存储的顶点 核心一些 , 和 。2D索引更节省空间,并且需要更少的构建时间,每个索引都可以支持核心查询。由于查询效率取决于输入参数和2D索引的选择,因此我们通过训练前馈神经网络来预测2D索引的最佳选择以最小化查询时间,从而提出一种基于学习的混合计算范式。大量实验表明(1)-core是一种有效的模型,可捕获独特且重要的内聚子图;(2)所提出的技术显着提高了索引构建和查询处理的效率。