当前位置: X-MOL 学术Computing › 论文详情
Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)
RNe2Vec: information diffusion popularity prediction based on repost network embedding
Computing ( IF 3.7 ) Pub Date : 2020-10-30 , DOI: 10.1007/s00607-020-00858-x
Jiaxing Shang , Shuo Huang , Dingyang Zhang , Zixuan Peng , Dajiang Liu , Yong Li , Lexi Xu

With the rapid development in artificial intelligence and mobile networks, the past decade has witnessed the flourish of social media, and information diffusion popularity prediction in social media has attracted wide attention in both academics and industrials. However, existing popularity prediction methods either rely heavily on human experience to handcraft the features, designate the generative model, or largely depend on the underlying user relation network for embedding learning. Motivated by the above observation, this paper studies the precise prediction of the information diffusion popularity only based on early repost information, given that the underlying user relation network is unknown. To solve this problem, we propose RNe2Vec (repost network to vector), a repost network embedding-based diffusion popularity prediction algorithm. Specifically, we first build a repost network from the early repost data, and then use biased random walks to generate node sequences, in which we elaborately design walking rules to capture different repost behaviors. After that we employ the skip-gram method to learn low dimensional node vectors from the node sequences. Finally, we apply PCA (principal component analysis) algorithm on the node vectors for dimensionality reduction, and combine the embedding features with handcrafted features to train the downstream machine learning models. Experimental results on a microblog dataset show that incorporating network embedding features can significantly improve the overall prediction accuracy.

中文翻译:

RNe2Vec:基于转发网络嵌入的信息扩散流行度预测

随着人工智能和移动网络的飞速发展,过去十年见证了社交媒体的蓬勃发展,社交媒体中的信息传播流行度预测引起了学术界和工业界的广泛关注。然而,现有的流行度预测方法要么严重依赖人类经验来手工制作特征、指定生成模型,要么很大程度上依赖于底层用户关系网络进行嵌入学习。受上述观察启发,本文研究了在底层用户关系网络未知的情况下,仅基于早期转发信息对信息扩散流行度的精确预测。为了解决这个问题,我们提出了 RNe2Vec(repost network to vector),一种基于 repost 网络嵌入的扩散流行度预测算法。具体来说,我们首先从早期的转发数据构建转发网络,然后使用有偏随机游走生成节点序列,其中我们精心设计了行走规则来捕获不同的转发行为。之后我们使用skip-gram方法从节点序列中学习低维节点向量。最后,我们在节点向量上应用 PCA(主成分分析)算法进行降维,并将嵌入特征与手工特征相结合来训练下游机器学习模型。在微博数据集上的实验结果表明,结合网络嵌入特征可以显着提高整体预测精度。其中我们精心设计了行走规则来捕捉不同的转发行为。之后我们使用skip-gram方法从节点序列中学习低维节点向量。最后,我们在节点向量上应用 PCA(主成分分析)算法进行降维,并将嵌入特征与手工特征相结合来训练下游机器学习模型。在微博数据集上的实验结果表明,结合网络嵌入特征可以显着提高整体预测精度。其中我们精心设计了行走规则来捕捉不同的转发行为。之后我们使用skip-gram方法从节点序列中学习低维节点向量。最后,我们在节点向量上应用 PCA(主成分分析)算法进行降维,并将嵌入特征与手工特征相结合来训练下游机器学习模型。在微博数据集上的实验结果表明,结合网络嵌入特征可以显着提高整体预测精度。并将嵌入特征与手工特征结合起来训练下游机器学习模型。在微博数据集上的实验结果表明,结合网络嵌入特征可以显着提高整体预测精度。并将嵌入特征与手工特征结合起来训练下游机器学习模型。在微博数据集上的实验结果表明,结合网络嵌入特征可以显着提高整体预测精度。
更新日期:2020-10-30
down
wechat
bug