当前位置: X-MOL 学术J. Comput. Sci. Tech. › 论文详情
Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)
DEM: Deep Entity Matching Across Heterogeneous Information Networks
Journal of Computer Science and Technology ( IF 1.9 ) Pub Date : 2020-07-01 , DOI: 10.1007/s11390-020-0139-5
Chao Kong , Bao-Xiang Chen , Li-Ping Zhang

Heterogeneous information networks, which consist of multi-typed vertices representing objects and multi-typed edges representing relations between objects, are ubiquitous in the real world. In this paper, we study the problem of entity matching for heterogeneous information networks based on distributed network embedding and multi-layer perceptron with a highway network, and we propose a new method named DEM short for Deep Entity Matching. In contrast to the traditional entity matching methods, DEM utilizes the multi-layer perceptron with a highway network to explore the hidden relations to improve the performance of matching. Importantly, we incorporate DEM with the network embedding methodology, enabling highly efficient computing in a vectorized manner. DEM’s generic modeling of both the network structure and the entity attributes enables it to model various heterogeneous information networks flexibly. To illustrate its functionality, we apply the DEM algorithm to two real-world entity matching applications: user linkage under the social network analysis scenario that predicts the same or matched users in different social platforms and record linkage that predicts the same or matched records in different citation networks. Extensive experiments on real-world datasets demonstrate DEM’s effectiveness and rationality.

中文翻译:

DEM:跨异构信息网络的深度实体匹配

由表示对象的多类型顶点和表示对象之间关系的多类型边组成的异构信息网络在现实世界中无处不在。在本文中,我们研究了基于分布式网络嵌入和多层感知器与高速公路网络的异构信息网络的实体匹配问题,并提出了一种称为深度实体匹配的简称 DEM 的新方法。与传统的实体匹配方法相比,DEM 利用多层感知器和高速公路网络来探索隐藏关系以提高匹配性能。重要的是,我们将 DEM 与网络嵌入方法相结合,以矢量化方式实现高效计算。DEM 对网络结构和实体属性的通用建模使其能够灵活地对各种异构信息网络进行建模。为了说明其功能,我们将 DEM 算法应用于两个现实世界的实体匹配应用:社交网络分析场景下的用户联动,预测不同社交平台中相同或匹配的用户,以及记录联动,预测不同社交平台中相同或匹配的记录。引文网络。对真实世界数据集的大量实验证明了 DEM 的有效性和合理性。社交网络分析场景下的用户联动,预测不同社交平台中相同或匹配的用户,记录联动预测不同引文网络中相同或匹配的记录。对真实世界数据集的大量实验证明了 DEM 的有效性和合理性。社交网络分析场景下的用户联动,预测不同社交平台中相同或匹配的用户,记录联动预测不同引文网络中相同或匹配的记录。对真实世界数据集的大量实验证明了 DEM 的有效性和合理性。
更新日期:2020-07-01
down
wechat
bug