A Bi-GRU with attention and CapsNet hybrid model for cyberbullying detection on social media,World Wide Web

当前位置： X-MOL 学术 › World Wide Web › 论文详情

Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)

A Bi-GRU with attention and CapsNet hybrid model for cyberbullying detection on social media
World Wide Web ( IF 3.7 ) Pub Date : 2021-07-22 , DOI: 10.1007/s11280-021-00920-4
Akshi Kumar ₁ , Nitin Sachdeva ₁

Affiliation

As a constructive mode of information sharing, collaboration and communication, social media platforms offer users with limitless opportunities. The same hypermedia can be transposed into a synthetic and toxic milieu that provides an anonymous, destructive pedestal for online bullying and harassment. Automatic cyberbullying detection on social media using synthetic or real-world datasets is one of a proverbial natural language processing problem. Analyzing a given text requires capturing the existent semantics, syntactic and spatial relationships. Learning representative features automatically using deep learning models efficiently captures the contextual semantics and word order arrangement to build robust and superlative predictive models. This work puts forward a hybrid model, Bi-GRU-Attention-CapsNet (Bi-GAC), that benefits by learning sequential semantic representations and spatial location information using a Bi-GRU with self-attention followed by CapsNet for cyberbullying detection in the textual content of social media. The proposed Bi-GAC model is evaluated for performance using F1-score and ROC-AUC curve as metrics. The results show a superior performance to the existing techniques on the benchmark Formspring.me and MySpace datasets. In comparison to the conventional models, an improvement of nearly 9% and 3% in F-score is observed for MySpace and Formspring.me dataset respectively.

中文翻译：

用于社交媒体网络欺凌检测的具有注意力和 CapsNet 混合模型的 Bi-GRU

作为一种信息共享、协作和交流的建设性模式，社交媒体平台为用户提供了无限的机会。同样的超媒体可以转换成一个合成和有毒的环境，为在线欺凌和骚扰提供一个匿名的、破坏性的基础。使用合成或真实世界数据集对社交媒体进行自动网络欺凌检测是众所周知的自然语言处理问题之一。分析给定的文本需要捕获现有的语义、句法和空间关系。使用深度学习模型自动学习代表性特征可以有效地捕获上下文语义和词序排列，以构建强大且最高级的预测模型。这项工作提出了一个混合模型，Bi-GRU-Attention-CapsNet (Bi-GAC)，通过使用具有自我注意的 Bi-GRU 和 CapsNet 学习顺序语义表示和空间位置信息，然后使用 CapsNet 来学习社交媒体文本内容中的网络欺凌检测，从而受益。使用 F1-score 和 ROC-AUC 曲线作为指标评估所提出的 Bi-GAC 模型的性能。结果表明，在基准 Formspring.me 和 MySpace 数据集上，性能优于现有技术。与传统模型相比，MySpace 和 Formspring.me 数据集的 F 分数分别提高了近 9% 和 3%。使用 F1-score 和 ROC-AUC 曲线作为指标评估所提出的 Bi-GAC 模型的性能。结果表明，在基准 Formspring.me 和 MySpace 数据集上，性能优于现有技术。与传统模型相比，MySpace 和 Formspring.me 数据集的 F 分数分别提高了近 9% 和 3%。使用 F1-score 和 ROC-AUC 曲线作为指标评估所提出的 Bi-GAC 模型的性能。结果表明，在基准 Formspring.me 和 MySpace 数据集上，性能优于现有技术。与传统模型相比，MySpace 和 Formspring.me 数据集的 F 分数分别提高了近 9% 和 3%。

更新日期：2021-07-22

点击分享查看原文

点击收藏

公开下载

阅读更多本刊最新论文

全部期刊列表>>