Skip to main content
Log in

ColluEagle: collusive review spammer detection using Markov random fields

  • Published:
Data Mining and Knowledge Discovery Aims and scope Submit manuscript

Abstract

Product reviews are extremely valuable for online shoppers in providing purchase decisions. Driven by immense profit incentives, fraudsters deliberately fabricate untruthful reviews to distort the reputation of online products. As online reviews become more and more important, group spamming, i.e., a team of fraudsters working collaboratively to attack a set of target products, becomes a new fashion. Previous works use review network effects, i.e. the relationships among reviewers, reviews, and products, to detect fake reviews or review spammers, but ignore time effects, which are critical in characterizing group spamming. In this paper, we propose a novel Markov random field (MRF)-based method (ColluEagle) to detect collusive review spammers, as well as review spam campaigns, considering both network effects and time effects. First we identify co-review pairs, a review phenomenon that happens between two reviewers who review a common product in a similar way, and then model reviewers and their co-review pairs as a pairwise-MRF, and use loopy belief propagation to evaluate the suspiciousness of reviewers. We further design a high quality yet easy-to-compute node prior for ColluEagle, through which the review spammer groups can also be subsequently identified. Experiments show that ColluEagle can not only detect collusive spammers with high precision, significantly outperforming state-of-the-art baselines—FraudEagle and SpEagle, but also identify highly suspicious review spammer campaigns.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7

Similar content being viewed by others

Notes

  1. https://github.com/zhuowangsylu/ColluEagle.

References

  • Akoglu L, Chandy R, Faloutsos C (2013) Opinion fraud detection in online reviews by network effects. In: Proceedings of the seventh international conference on weblogs and social media, ICWSM 2013, Cambridge, Massachusetts, USA, July 8–11

  • Crawford M, Khoshgoftaar TM, Prusa JD, Richter AN, Al Najada H (2015) Survey of review spam detection using machine learning techniques. J Big Data 2(1):23. https://doi.org/10.1186/s40537-015-0029-9

    Article  Google Scholar 

  • Fei G, Mukherjee A, Liu B, Hsu M, Castellanos M, Ghosh R (2013) Exploiting burstiness in reviews for review spammer detection. In: Seventh international AAAI conference on weblogs and social media

  • Jindal N, Liu B (2008) Opinion spam and analysis. In: Proceedings of the 2008 international conference on web search and data mining. ACM, New York, pp 219–230

  • Li J, Cardie C, Li S (2013) Topicspam: a topic-model based approach for spam detection. In: Proceedings of the 51st annual meeting of the association for computational linguistics, ACL 2013, 4–9 August 2013, Sofia, Bulgaria, vol 2, short papers, pp 217–221

  • Li H, Mukherjee A, Liu B, Kornfield R, Emery S (2014) Detecting campaign promoters on twitter using Markov random fields. In: 2014 IEEE international conference on data mining, ICDM 2014, Shenzhen, China, December 14–17, 2014, pp 290–299. https://doi.org/10.1109/ICDM.2014.59

  • Lim EP, Nguyen VA, Jindal N, Liu B, Lauw HW (2010) Detecting product review spammers using rating behaviors. In: Proceedings of the 19th ACM international conference on information and knowledge management, CIKM ’10. New York, NY, USA, pp 939–948

  • Mukherjee A, Liu B, Glance N (2012) Spotting fake reviewer groups in consumer reviews. In: Proceedings of the 21st international conference on world wide web. ACM, New York, pp 191–200

  • Mukherjee A, Venkataraman V, Liu B, Glance NS (2013) What yelp fake review filter might be doing? In: Proceedings of the seventh international conference on weblogs and social media, ICWSM 2013, Cambridge, Massachusetts, USA, July 8–11

  • Ott M, Choi Y, Cardie C, Hancock JT (2011) Finding deceptive opinion spam by any stretch of the imagination. In: Proceedings of the 49th annual meeting of the association for computational linguistics: human language technologies, vol 1, Stroudsburg, PA, USA, pp 309–319

  • Rastogi A, Mehrotra M (2017) Opinion spam detection in online reviews. JIKM 16(4):1–38

    Google Scholar 

  • Rayana S, Akoglu L (2015) Collective opinion spam detection: bridging review networks and metadata. In: Proceedings of the 21th ACM SIGKDD international conference on knowledge discovery and data mining, Sydney, NSW, Australia, August 10–13, 2015, pp 985–994

  • Viviani M, Pasi G (2017) Credibility in social media: opinions, news, and health information—a survey. Wiley Interdiscip Rev Data Min Knowl Discov 7(5):e1209

    Article  Google Scholar 

  • Wang G, Xie S, Liu B, Yu PS (2011) Review graph based online store review spammer detection. In: 11th IEEE international conference on data mining, ICDM 2011, Vancouver, BC, Canada, December 11–14, 2011, pp 1242–1247

  • Wang Z, Hou T, Song D, Li Z, Kong T (2016) Detecting review spammer groups via bipartite graph projection. Comput J 59(6):861–874. https://doi.org/10.1093/comjnl/bxv068

    Article  Google Scholar 

  • Wang Z, Gu S, Xu X (2018a) GSLDA: LDA-based group spamming detection in product reviews. Appl Intell 48(9):3094–3107

    Article  Google Scholar 

  • Wang Z, Gu S, Zhao X, Xu X (2018b) Graph-based review spammer group detection. Knowl Inf Syst 55(3):571–597. https://doi.org/10.1007/s10115-017-1068-7

    Article  Google Scholar 

  • Xie S, Wang G, Lin S, Yu PS (2012) Review spam detection via time series pattern discovery. In: Proceedings of the 21st international conference companion on World Wide Web, New York, NY, USA, pp 635–636

  • Xu X, Yuruk N, Feng Z, Schweiger TAJ (2007) SCAN: a structural clustering algorithm for networks. In: Proceedings of the 13th ACM SIGKDD international conference on knowledge discovery and data mining, San Jose, California, USA, August 12–15, 2007, pp 824–833

  • Xu C, Zhang J (2015) Towards collusive fraud detection in online reviews. In: 2015 IEEE international conference on data mining, ICDM 2015, Atlantic City, NJ, USA, November 14–17, 2015, pp 1051–1056

  • Xu C, Zhang J, Chang K, Long C (2013) Uncovering collusive spammers in Chinese review websites. In: Proceedings of the 22nd ACM international conference on conference on information and knowledge management. ACM, New York, pp 979–988

  • Ye J, Akoglu L (2015) Discovering opinion spammer groups by network footprints. In: Appice A, Rodrigues P, Santos Costa V, Soares C, Gama J, Jorge A (eds) Machine learning and knowledge discovery in databases, vol 9284. Lecture notes in computer science. Springer, Berlin, pp 267–282

    Chapter  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Zhuo Wang.

Additional information

Responsible editor: G. Karypis.

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Wang, Z., Hu, R., Chen, Q. et al. ColluEagle: collusive review spammer detection using Markov random fields. Data Min Knowl Disc 34, 1621–1641 (2020). https://doi.org/10.1007/s10618-020-00693-w

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10618-020-00693-w

Keywords

Navigation