Elsevier

Decision Support Systems

Volume 138, November 2020, 113362
Decision Support Systems

Antisocial online behavior detection using deep learning

https://doi.org/10.1016/j.dss.2020.113362Get rights and content

Highlights

  • Comprehensive benchmark on deep learning regimes for AOB detection.

  • Usage of transformer-based language models.

  • Interpretability module to understand the model's logic.

  • Interpretability module to detect unintended bias.

Abstract

Digitalization shifts human communication to online platforms, which has many benefits but also builds up a space for antisocial online behavior (AOB) such as harassment, insult and other forms of hateful textual content. Online platforms have good reasons to monitor and moderate such content. The paper examines the viability of automatic content monitoring using deep machine learning and natural language processing (NLP). More specifically, we consolidate prior work in the field of antisocial online behavior detection and compare relevant approaches to recent NLP models in an empirical study. Covering important methodological advancements in NLP including bidirectional encoding, attention, hierarchical text representations, and pre-trained transformer-based language models, and extending previous approaches by introducing a pseudo-sentence hierarchical attention network, the paper provides a comprehensive summary of the state-of-affairs in NLP-based AOB detection, clarifies the detection accuracy that is attainable with today's technology, discusses whether this degree is sufficient for deploying deep learning-based text screening systems, and approaches the interpretability topic.

Introduction

The shift of human communication to online platforms is a double-edged sword. Social benefits include the opportunity to share opinions and experiences, get immediate feedback, and the opportunity to discuss the hottest topics. From an economic perspective, the data from online communications enable business organization to learn from customer experiences, improve service offerings, and raise firm performance. Examples of corresponding advancements include Liu et al. [1], who propose a method to assess a product's competitive advantages based on social media. Similarly, Zhang et al. [2] use natural language processing (NLP) to analyze knowledge payment platforms and shed light on customer satisfaction, while Siering et al. [3] examine online reviews to identify what service aspects customers value the most. On the other hand, online communication platforms also create a space for malicious behavior such as the distribution of fake news and reviews, which distort insights gained from the data and may harm the reputation of the platform [4,5]. The focus of this work is related to a similar problem: the detection of antisocial behavior, such as insulting, harassment, or threatening in online communication.

Detection of such antisocial behavior is highly important for social welfare due to social, legislative, and financial reasons. According to the Cyberbullying Research Center's annual data in 2016, 33.8% of young people aged 12–17 in the US have experienced cyberbullying in their lifetime [6]. According to one German law, social media providers like Facebook, Google, Microsoft are obliged in Germany to remove hate speech posts within 24 h and report on their progress every six months [7]. Legal requirements, social norms, and codes of conduct emphasizes the importance for online platforms to identify antisocial online behavior (AOB), which we use as an umbrella term for any malicious behavior that can be found in the textual content on online communications platforms including insult, threat, personal attack, usage of harmful, rude or offensive language, cyberbullying and abuse.

Manual detection and monitoring of online content can be very costly, making autonomous systems for screening user-generated text content for traces of AOB a key attention point. Machine learning-based decision support systems that pre-screen transactions and flag suspicious cases for subsequent human inspection have proven effective in fraud detection [8] and may prove a viable solution for the AOB detection problem of social media platforms.

In the paper, we elaborate on the detection of AOB using NLP. Early academic research in the field was mostly concerned with the use of traditional machine learning methods (TML) such as logistic regressions, support vector machines, and decision trees [e.g., [9]], as well as lexicon-based approaches [e.g., [10]]. These methods heavily rely on extensive feature engineering, and their performance highly depends on the representation of the data. DL methods automate the procedure of feature engineering by learning the representations of the data through non-linear transformations. Such representations often achieve better performance than handcrafted features [11]. The main contribution of the paper is the following: we consolidate prior work on AOB detection and text classification and provide a comprehensive benchmark of alternative text processing regimes. We compare methods of TML with deep learning (DL) while covering significant methodological advancements, including bidirectional encoding, attention, and techniques to exploit the hierarchical structure of text. Many of these DL techniques are new to the field of AOB detection and systematic comparisons of their potential to raise detection accuracy are, to our best knowledge, not available in prior research. Further, we extend hierarchical DL models and introduce a pseudo-sentence hierarchical attention network. We investigate the potential of deep NLP transfer learning for AOB detection by considering transformer-based language models such as BERT in our analysis. Finally, we propose the usage of the LIME framework developed by Ribeiro et al. [12] as a final stage of AOB detection. This framework provides interpretability of the model's underlying logic, which might help moderators to decide whether to filter a post. Machine learning-based systems often reflect existing demographic biases [13], which might lead to “unfair” decisions. Interpretability also ensures that model predictions can be checked for possible unintended bias, which would require adjustment or revision of the detection model. All codes used for the experiment are available on Github at https://github.com/QuantLet/AOBDL_code. Moreover, the reader can find an online appendix containing details on parameter tuning, used DL architectures, and additional literature on AOB detection at https://github.com/QuantLet/AOBDL_code/blob/master/AOBDL_Online_Appendix.pdf.

Section snippets

Related work

In only a few years, DL methods have revolutionized the fields of computer vision and NLP, in which they can now be considered a quasi-standard [14]. Recently, a few DL-based approaches have appeared in the decision support literature. We review corresponding research below and distinguish between approaches that support decision-making based on analyzing structured versus unstructured data. This is to sketch the status-quo of DL-based decision support (DS). Thereafter, we review prior work on

Methodology

To fully appreciate the technical content, the reader might benefit from the following overview of ML technologies. In Fig. 1, we summarize our motivation on what machine learning approaches to include. The figure shows different methods and their drawbacks, which can be handled by more complex models.

We start with methods of TML, and as mentioned in the introduction, these methods heavily rely on handcrafted features, whereas DL models help to learn abstract data representations and extract

Dataset

The first data set used for the experiments comes from a Kaggle competition “Toxic Comment Classification”1. This competition is dedicated to the identification of different levels of toxicity in the Wikipedia Talk Pages. The second data set is retrieved from Twitter and was created by Davidson et al. [53], where the authors used it for automatic hate-speech detection. The third dataset is the English and Hindi data from

TML vs. DL

As a first experiment, we compare TML methods with CNN and GRU, two basic DNN architectures, which many more sophisticated models are based on. To that end, we select TML methods that have been used frequently in the AOB literature, including support vector machines (SVM), logistic regression with l2 regularization (LR), and random forest (RF) [e.g., [9], [36]]. Moreover, we consider gradient boosting (LightGBM) due to this model's good performance in prediction benchmarks. Finally, we consider

Conclusion and further work

Detection and prevention of AOB in online content have become an essential problem for social welfare and companies that provide platforms where user-generated content is shared. Manual monitoring of such behavior can be very costly and time-consuming. On the other hand, the absence of moderation can lead to regulatory consequences. This is why support systems that screen user-generated text content and identify cases that warrant manual inspection are of high importance. DL methods are a

Declaration of Competing Interest

None.

Acknowledgements

Financial support from the Deutsche Forschungsgemeinschaft via the IRTG 1792 “High Dimensional Nonstationary Time Series”, Humboldt-Universität zu Berlin, is gratefully acknowledged.

Elizaveta Zinovyeva is a PhD student of of the International Research Training Group IRTG1792 “High dimensional nonstationary time series” at the Humboldt-Universität zu Berlin. Previously she has completed her Master's studies in Information Systems and Bachelor's studies in Business Administration at the Humboldt-Universität zu Berlin. Her research focuses on application of deep neural networks on sequential data.

References (58)

  • N.N. Vo et al.

    Deep learning for decision making and the optimization of socially responsible investments and portfolio

    Decis. Support. Syst.

    (2019)
  • P. Hájek

    Municipal credit rating modelling by neural networks

    Decis. Support. Syst.

    (2011)
  • K. Coussement et al.

    A comparative analysis of data preparation algorithms for customer churn prediction: a case study in the telecommunication industry

    Decis. Support. Syst.

    (2017)
  • A. De Caigny et al.

    Leveraging fine-grained transaction data for customer life event predictions

    Decis. Support. Syst.

    (2020)
  • M. Kraus et al.

    Deep learning in business analytics and operations research: models, applications and managerial implications

    Eur. J. Oper. Res.

    (2020)
  • A.L. Loureiro et al.

    Exploring the use of deep neural networks for sales forecasting in fashion retail

    Decis. Support. Syst.

    (2018)
  • D. Koehn et al.

    Predicting online shopping behaviour from clickstream data using deep learning

    Expert Syst. Appl.

    (2020)
  • T. Fischer et al.

    Deep learning with long short-term memory networks for financial market predictions

    Eur. J. Oper. Res.

    (2018)
  • X. Bai

    Predicting consumer sentiments from online text

    Decis. Support. Syst.

    (2011)
  • M. Cecchini et al.

    Making words work: using financial text as a predictor of financial events

    Decis. Support. Syst.

    (2010)
  • M. Kraus et al.

    Decision support from financial disclosures with deep neural networks and transfer learning

    Decis. Support. Syst.

    (2017)
  • D. Zhu et al.

    Unsupervised tip-mining from customer reviews

    Decis. Support. Syst.

    (2018)
  • B. Kratzwald et al.

    Deep learning for affective computing: text-based emotion recognition in decision support

    Decis. Support. Syst.

    (2018)
  • Y. Wang et al.

    Leveraging deep learning with lda-based text analytics to detect automobile insurance fraud

    Decis. Support. Syst.

    (2018)
  • Ž. Deljac et al.

    Early detection of network element outages based on customer trouble calls

    Decis. Support. Syst.

    (2015)
  • O. Ivanova et al.

    How can online marketplaces reduce rating manipulation? A new approach on dynamic aggregation of online ratings

    Decis. Support. Syst.

    (2017)
  • H. Dutta et al.

    A system for intergroup prejudice detection: the case of microblogging under terrorist attacks

    Decis. Support. Syst.

    (2018)
  • N. Mahmoudi et al.

    Deep neural networks understand investors better

    Decis. Support. Syst.

    (2018)
  • J.W. Patchin

    2016 Cyberbullying Data

    (2016)
  • Cited by (40)

    • Offensive language identification in dravidian languages using MPNet and CNN

      2023, International Journal of Information Management Data Insights
      Citation Excerpt :

      However, these platforms have also become spaces where people are targeted, defamed, and marginalised based on their physical appearance, religion, sexual orientation, and many other factors (Benikova, Wojatzki, & Zesch, 2018; Keipi, Näsi, Oksanen, & Räsänen, 2016; Pamungkas, Basile, & Patti, 2020). Social media has developed into a specialised instrument for verbally threatening and cornering people, not based on their actions but on their identities (Maitra & McGowan, 2012; Patton, Eschmann, & Butler, 2013; Zinovyeva, Hrdle, & Lessmann, 2020). The depth and breadth of this ‘digital marvel’ have enabled previously ‘invisible and socially paralysed’ populations to participate in social discourses (Barnidge, Kim, Sherrill, Luknar, & Zhang, 2019).

    • Online offensive behaviour in socialmedia: Detection approaches, comprehensive review and future directions

      2023, Entertainment Computing
      Citation Excerpt :

      The disadvantage of this paper is that it did not consider optimization techniques to enhance the results. Zinovyeva et al. [68] have proposed utilizing the Local Interpretable Model Agnostic Explanations (LIME) framework to detect antisocial online behaviour. The framework gave the underlying logic interpretability of the model, which might help decide whether or not to filter a post.

    View all citing articles on Scopus

    Elizaveta Zinovyeva is a PhD student of of the International Research Training Group IRTG1792 “High dimensional nonstationary time series” at the Humboldt-Universität zu Berlin. Previously she has completed her Master's studies in Information Systems and Bachelor's studies in Business Administration at the Humboldt-Universität zu Berlin. Her research focuses on application of deep neural networks on sequential data.

    Wolfgang Karl Härdle attained his Dr. rer. nat. in Mathematics at Universität Heidelberg in 1982 and in 1988 his habilitation at Universität Bonn. He is Ladislaus von Bortkiewicz Professor of Statistics at Humboldt-Universität zu Berlin and the director of the Sino German International Research Training Group IRTG1792 “High dimensional nonstationary time series”, a joint project with WISE, Xiamen University.

    His research focuses on data sciences, dimension reduction and quantitative finance. He has published over 30 books and more than 300 papers in top statistical, econometrics and finance journals. He is highly ranked and cited on Google Scholar, REPEC and SSRN. He has professional experience in financial engineering, smart (specific, measurable, achievable, relevant, timely) data analytics, machine learning and cryptocurrency markets.

    Stefan Lessmann received a diploma in business administration and a PhD from the University of Hamburg in 2002 and 2007, respectively. Stefan worked as a lecturer and senior lecture in business informatics at the Institute of Information Systems of the University of Hamburg. Since 2008, Stefan is a guest lecturer at the School of Management of University of Southampton, where he teaches under- and postgraduate courses on quantitative methods, electronic business, and web application development. Stefan completed his habilitation in the area of predictive analytics in 2012. In 2014, Stefan joined the Humboldt-University of Berlin, where he heads the Chair of Information Systems at the School of Business and Economics. Stefan published several papers in leading international journals and conferences, including the European Journal of Operational Research, the IEEE Transactions of Software Engineering, and the International Conference on Information Systems. He actively participates in knowledge transfer and consulting projects with industry partners; from small start-up companies to global players.

    View full text