当前位置: X-MOL 学术arXiv.cs.CL › 论文详情
Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)
Are Chess Discussions Racist? An Adversarial Hate Speech Data Set
arXiv - CS - Computation and Language Pub Date : 2020-11-20 , DOI: arxiv-2011.10280
Rupak Sarkar, Ashiqur R. KhudaBukhsh

On June 28, 2020, while presenting a chess podcast on Grandmaster Hikaru Nakamura, Antonio Radi\'c's YouTube handle got blocked because it contained "harmful and dangerous" content. YouTube did not give further specific reason, and the channel got reinstated within 24 hours. However, Radi\'c speculated that given the current political situation, a referral to "black against white", albeit in the context of chess, earned him this temporary ban. In this paper, via a substantial corpus of 681,995 comments, on 8,818 YouTube videos hosted by five highly popular chess-focused YouTube channels, we ask the following research question: \emph{how robust are off-the-shelf hate-speech classifiers to out-of-domain adversarial examples?} We release a data set of 1,000 annotated comments where existing hate speech classifiers misclassified benign chess discussions as hate speech. We conclude with an intriguing analogy result on racial bias with our findings pointing out to the broader challenge of color polysemy.

中文翻译:

是国际象棋讨论种族主义者吗?对抗性仇恨言论数据集

2020年6月28日,在中村光大师(Hikaru Nakamura)上展示国际象棋播客时,安东尼奥·雷迪克(Antonio Radic)的YouTube手柄因包含“有害和危险”内容而被封锁。YouTube没有提供进一步的具体原因,该频道已在24小时内恢复。但是,拉迪克推测,鉴于当前的政治局势,尽管是在国际象棋的背景下,转而引用“黑与白”,却使他获得了这一临时禁令。本文通过681995条评论的主体,对由五个非常受国际象棋关注的YouTube频道托管的8,818个YouTube视频进行了调查,提出了以下研究问题:\ emph {现成的仇恨语音分类器对域外对抗示例?}我们发布了一个数据集1,000条带注释的注释,其中现有的仇恨言论分类器将良性国际象棋讨论误分类为仇恨言论。我们得出了一个关于种族偏见的有趣的类比结果,我们的发现指出了色彩多义性的更广泛挑战。
更新日期:2020-11-23
down
wechat
bug