CyberBERT: BERT for cyberbullying identification,Multimedia Systems

当前位置： X-MOL 学术 › Multimedia Syst. › 论文详情

Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)

CyberBERT: BERT for cyberbullying identification
Multimedia Systems ( IF 3.9 ) Pub Date : 2020-11-11 , DOI: 10.1007/s00530-020-00710-4
Sayanta Paul , Sriparna Saha

Cyberbullying can be delineated as a purposive and recurrent act, which is aggressive in nature, done via different social media platforms such as Facebook, Twitter, Instagram, and others. A state-of-the-art pre-training language model, BERT (Bidirectional Encoder Representations from Transformers), has achieved remarkable results in many language understanding tasks. In this paper, we present a novel application of BERT for cyberbullying identification. A straightforward classification model using BERT is able to achieve the state-of-the-art results across three real-world corpora: Formspring ( $$\sim 12\hbox {k}$$ posts), Twitter ( $$\sim 16\hbox {k}$$ posts), and Wikipedia ( $$\sim 100\hbox {k}$$ posts). Experimental results demonstrate that our proposed model achieves significant improvements over existing works, in comparison with the slot-gated or attention-based deep neural network models.

中文翻译：

CyberBERT：用于网络欺凌识别的 BERT

网络欺凌可以被描述为一种有目的的、经常性的行为，这种行为本质上是侵略性的，通过 Facebook、Twitter、Instagram 等不同的社交媒体平台进行。最先进的预训练语言模型 BERT（来自 Transformers 的双向编码器表示）在许多语言理解任务中取得了显著成果。在本文中，我们提出了 BERT 用于网络欺凌识别的新应用。使用 BERT 的简单分类模型能够在三个真实世界语料库中获得最先进的结果：Formspring ( $$\sim 12\hbox {k}$$ posts)、Twitter ( $$\sim 16 \hbox {k}$$ 帖子）和维基百科（ $$\sim 100\hbox {k}$$ 帖子）。实验结果表明，我们提出的模型比现有工作取得了显着的改进，

更新日期：2020-11-11

点击分享查看原文

点击收藏

阅读更多本刊最新论文本刊介绍/投稿指南

全部期刊列表>>