Abstract
Cross-network anchor link discovery is an important research problem and has many applications in heterogeneous social network. Existing schemes of cross-network anchor link discovery can provide reasonable link discovery results, but the quality of these results depends on the features of the platform. Therefore, there is no theoretical guarantee to the stability. This article employs user embedding feature to model the relationship between cross-platform accounts, that is, the more similar the user embedding features are, the more similar the two accounts are. The similarity of user embedding features is determined by the distance of the user features in the latent space. Based on the user embedding features, this article proposes an embedding representation-based method Con&Net(Content and Network) to solve cross-network anchor link discovery problem. Con&Net combines the user’s profile features, user-generated content (UGC) features, and user’s social structure features to measure the similarity of two user accounts. Con&Net first trains the user’s profile features to get profile embedding. Then it trains the network structure of the nodes to get structure embedding. It connects the two features through vector concatenating, and calculates the cosine similarity of the vector based on the embedding vector. This cosine similarity is used to measure the similarity of the user accounts. Finally, Con&Net predicts the link based on similarity for account pairs across the two networks. A large number of experiments in Sina Weibo and Twitter networks show that the proposed method Con&Net is better than state-of-the-art method. The area under the curve (AUC) value of the receiver operating characteristic (ROC) curve predicted by the anchor link is 11% higher than the baseline method, and Precision@30 is 25% higher than the baseline method.
- Marco Balduzzi, Christian Platzer, Thorsten Holz, Engin Kirda, Davide Balzarotti, and Christopher Kruegel. 2010. Abusing social networks for automated user profiling. In Proceedings of the International Workshop on Recent Advances in Intrusion Detection. Jha S., Sommer R., Kreibich C. (Eds.), Lecture Notes in Computer Science, Vol. 6307, Springer, 422–441.Google Scholar
- Sergey Bartunov, Anton Korshunov, Seung-Taek Park, Wonho Ryu, and Hyungdong Lee. 2012. Joint link-attribute user identity resolution in online social networks. In Proceedings of the 6th International Conference on Knowledge Discovery and Data Mining, Workshop on Social Network Mining and Analysis. ACM.Google Scholar
- Sonja Buchegger, Doris Schiöberg, Le-Hung Vu, and Anwitaman Datta. 2009. PeerSoN: P2P social networking: Early experiences and insights. In Proceedings of the 2nd ACM EuroSys Workshop on Social Network Systems. ACM, 46–52.Google ScholarDigital Library
- Rhonda Chaytor, Edward Brown, and Todd Wareham. 2006. Privacy advisors for personal information management. In Proceedings of the SIGIR Workshop on Personal Information Management. 28–31.Google Scholar
- Zhiyuan Cheng, James Caverlee, and Kyumin Lee. 2010. You are where you tweet: A content-based approach to geo-locating twitter users. In Proceedings of the 19th ACM International Conference on Information and Knowledge Management. ACM, 759–768.Google ScholarDigital Library
- Xiaokai Chu, Xinxin Fan, Di Yao, Zhihua Zhu, Jianhui Huang, and Jingping Bi. 2019. Cross-network embedding for multi-network alignment. In Proceedings of the 2019 World Wide Web Conference. 273–284.Google ScholarDigital Library
- Yi Cui, Jian Pei, Guanting Tang, Wo-Shun Luk, Daxin Jiang, and Ming Hua. 2013. Finding e-mail correspondents in online social networks. World Wide Web 16, 2 (2013), 195–218.Google ScholarDigital Library
- Daniel M. Dunlavy, Tamara G. Kolda, and Evrim Acar. 2011. Temporal link prediction using matrix and tensor factorizations. ACM Transactions on Knowledge Discovery from Data 5, 2 (2011), 10.Google Scholar
- Shuo Feng, Qian Wang, Derong Shen, Yue Kou, Tiezheng Nie, and Ge Yu. 2017. User identification across social networks based on global view features. In Proceedings of the 2017 14th Web Information Systems and Applications Conference. IEEE, 93–98.Google ScholarCross Ref
- Neil Zhenqiang Gong, Ameet Talwalkar, Lester Mackey, Ling Huang, Eui Chul Richard Shin, Emil Stefanov, Elaine Runting Shi, and Dawn Song. 2014. Joint link prediction and attribute inference using a social-attribute network. ACM Transactions on Intelligent Systems and Technology 5, 2 (2014), 27.Google ScholarDigital Library
- Aditya Grover and Jure Leskovec. 2016. node2vec: Scalable feature learning for networks. In Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. 855–864.Google ScholarDigital Library
- Desislava Hristova, Anastasios Noulas, Chloë Brown, Mirco Musolesi, and Cecilia Mascolo. 2016. A multilayer approach to multiplexity and link prediction in online geo-social networks. EPJ Data Science 5, 1 (2016), 24.Google ScholarCross Ref
- Shouling Ji, Weiqing Li, Neil Zhenqiang Gong, Prateek Mittal, and Raheem Beyah. 2016. Seed-based de-anonymizability quantification of social networks. IEEE Transactions on Information Forensics and Security 11, 7 (2016), 1398–1411.Google ScholarDigital Library
- Xiangnan Kong, Jiawei Zhang, and Philip S. Yu. 2013. Inferring anchor links across multiple heterogeneous social networks. In Proceedings of the 22nd ACM International Conference on Information and Knowledge Management. ACM, 179–188.Google Scholar
- Nitish Korula and Silvio Lattanzi. 2014. An efficient reconciliation algorithm for social networks. Proceedings of the VLDB Endowment 7, 5 (2014), 377–388.Google ScholarDigital Library
- Danai Koutra, Hanghang Tong, and David Lubensky. 2013. Big-align: Fast bipartite graph alignment. In Proceedings of the 2013 IEEE 13th International Conference on Data Mining. IEEE, 389–398.Google ScholarCross Ref
- Quoc Le and Tomas Mikolov. 2014. Distributed representations of sentences and documents. In Proceedings of the 31st International Conference on International Conference on Machine Learning. 1188–1196.Google ScholarDigital Library
- X. Li, Y. Shang, Y. Cao, Y. Li, and Y. Liu. 2020. Type-aware anchor link prediction across heterogeneous networks based on graph attention network. Proceedings of the AAAI Conference on Artificial Intelligence 34, 1 (2020), 147–155.Google Scholar
- Jiongqian Liang, Deepak Ajwani, Patrick K. Nicholson, Alessandra Sala, and Srinivasan Parthasarathy. 2016. What links alice and bob?: Matching and ranking semantic patterns in heterogeneous networks. In Proceedings of the 25th International Conference on World Wide Web. 879–889.Google ScholarDigital Library
- Jing Liu, Fan Zhang, Xinying Song, Young-In Song, Chin-Yew Lin, and Hsiao-Wuen Hon. 2013. What’s in a name?: An unsupervised approach to link users across communities. In Proceedings of the 6th ACM International Conference on Web Search and Data Mining. ACM, 495–504.Google ScholarDigital Library
- L. Liu, X. Li, W. K. Cheung, and L. Liao. 2020. Structural representation learning for user alignment across social networks. IEEE Transactions on Knowledge and Data Engineering 32, 9 (2020), 1824–1837.Google ScholarDigital Library
- Siyuan Liu, Shuhui Wang, Feida Zhu, Jinbo Zhang, and Ramayya Krishnan. 2014. Hydra: Large-scale social identity linkage via heterogeneous behavior modeling. In Proceedings of the 2014 ACM SIGMOD International Conference on Management of Data. ACM, 51–62.Google ScholarDigital Library
- Yoshua Bengio, Aaron Courville, and Pascal Vincent. 2013. Representation learning: A review and new perspectives. IEEE Transactions on Pattern Analysis and Machine Intelligence 35, 8 (2013), 1798–1828.Google ScholarDigital Library
- Arvind Narayanan and Vitaly Shmatikov. 2009. De-anonymizing social networks. In Proceedings of the 2009 30th IEEE Symposium on Security and Privacy. IEEE, 173–187.Google ScholarDigital Library
- A. Narayanan and V. Shmatikov. 2009. De-anonymizing social networks. In Proceedings of the 2009 30th IEEE Symposium on Security and Privacy. 173–187.Google Scholar
- Daniele Perito, Claude Castelluccia, Mohamed Ali Kaafar, and Pere Manils. 2011. How unique and traceable are usernames? In Proceedings of the International Symposium on Privacy Enhancing Technologies Symposium. Fischer-Hbner S. and Hopper N. (Eds.), Lecture Notes in Computer Science, Vol. 6794. Springer, 1–17.Google Scholar
- Bryan Perozzi, Rami Al-Rfou, and Steven Skiena. 2014. Deepwalk: Online learning of social representations. In Proceedings of the 20th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. ACM, 701–710.Google ScholarDigital Library
- Yizhou Sun, Rick Barber, Manish Gupta, Charu C. Aggarwal, and Jiawei Han. 2011. Co-author relationship prediction in heterogeneous bibliographic networks. In Proceedings of the 2011 International Conference on Advances in Social Networks Analysis and Mining. IEEE, 121–128.Google ScholarDigital Library
- Shulong Tan, Ziyu Guan, Deng Cai, Xuzhen Qin, Jiajun Bu, and Chun Chen. 2014. Mapping users across networks by manifold alignment on hypergraph. In Proceedings of the 28th AAAI Conference on Artificial Intelligence. 159–165.Google Scholar
- Shinji Umeyama. 1988. An eigendecomposition approach to weighted graph matching problems. IEEE Transactions on Pattern Analysis and Machine Intelligence 10, 5 (1988), 695–703.Google ScholarDigital Library
- Peng Wang, BaoWen Xu, YuRong Wu, and XiaoYu Zhou. 2015. Link prediction in social networks: The state-of-the-art. Science China Information Sciences 58, 1 (2015), 1–38.Google ScholarCross Ref
- S. K. Wang, X. T. Li, Y. M. Ye, S. S. Feng, R. Y. K. Lau, X. H. Huang, and X. L. Du. 2019. Anchor link prediction across attributed networks via network embedding. Entropy 21, 3 (2019), 13. Google Scholar
- Rui Ye, Xin Li, Yujie Fang, Hongyu Zang, and Mingzhong Wang. 2019. A vectorized relational graph convolutional network for multi-relational network alignment. In Proceedings of the 28th International Joint Conference on Artificial Intelligence. 4135–4141.Google ScholarCross Ref
- Reza Zafarani and Huan Liu. 2009. Connecting corresponding identities across communities. In Proceedings of the 3rd International AAAI Conference on Weblogs and Social Media. 354–357.Google Scholar
- Reza Zafarani and Huan Liu. 2013. Connecting users across social media sites: A behavioral-modeling approach. In Proceedings of the 19th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. ACM, 41–49.Google ScholarDigital Library
- Jiawei Zhang, Xiangnan Kong, and Philip S. Yu. 2014. Transferring heterogeneous links across location-based social networks. In Proceedings of the 7th ACM International Conference on Web Search and Data Mining. ACM, 303–312.Google Scholar
- Jiawei Zhang and S. Yu Philip. 2015. Multiple anonymized social networks alignment. In Proceedings of the 2015 IEEE International Conference on Data Mining. IEEE, 599–608.Google Scholar
- Jiawei Zhang, Weixiang Shao, Senzhang Wang, Xiangnan Kong, and S. Yu Philip. 2015. Partial network alignment with generic stable matching. In Proceedings of the 2015 IEEE International Conference on Information Reuse and Integration. IEEE, 166–173.Google Scholar
- Jiawei Zhang and Philip S. Yu. 2015. Integrated anchor and social link predictions across social networks. In Proceedings of the 24th International Conference on Artificial Intelligence. 2125–2131.Google Scholar
- Yutao Zhang, Jie Tang, Zhilin Yang, Jian Pei, and Philip S. Yu. 2015. Cosnet: Connecting heterogeneous social networks with local and global consistency. In Proceedings of the 21th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. ACM, 1485–1494.Google Scholar
- Junxing Zhu, Jiawei Zhang, Quanyuan Wu, Yan Jia, Bin Zhou, Xiaokai Wei, and Philip Yu. 2017. Constrained active learning for anchor link prediction across multiple heterogeneous social networks. Sensors 17, 8 (2017), 1786.Google ScholarCross Ref
Index Terms
- Con&Net: A Cross-Network Anchor Link Discovery Method Based on Embedding Representation
Recommendations
A novel cross-network node pair embedding methodology for anchor link prediction
AbstractAnchor link prediction across social networks is highly important for multiple social network analysis. Traditional methods rely heavily on user-generated information or the quality of network topology information and are not suitable for real-...
Network embedding based link prediction in dynamic networks
AbstractLink prediction is a fundamental task in network theory due to the wide variety of applications in different domains. The objective of link prediction is to find the future links that are likely to be seen in some future time. In this ...
Highlights- We propose a novel edge embedding-based method for link prediction.
- Our ...
Link Prediction Based on Smooth Evolution of Network Embedding
Web Information Systems and ApplicationsAbstractThe problem of link prediction in dynamic heterogeneous information networks has been widely studied in recent years. The technique of network embedding has been proved extremely useful for link prediction. However, the existing methods lack the ...
Comments