Modeling Incongruity between Modalities for Multimodal Sarcasm Detection,IEEE Multimedia

当前位置： X-MOL 学术 › IEEE Multimed. › 论文详情

Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)

Modeling Incongruity between Modalities for Multimodal Sarcasm Detection
IEEE Multimedia ( IF 2.3 ) Pub Date : 2021-03-26 , DOI: 10.1109/mmul.2021.3069097
Yang Wu ₁ , Yanyan Zhao ₁ , Xin Lu ₁ , Bing Qin ₁ , Yin Wu ₂ , Jian Sheng ₂ , Jinlong Li ₂

Affiliation

Sarcasm is a sophisticated linguistic phenomenon and commonly manifests on social media platforms, which poses a great challenge for opinion mining systems. Therefore, multimodal sarcasm detection, which aims to understand the implied sentiment in the video, has gained more and more attention. However, previous works mostly focus on multimodal feature fusion without explicitly modeling the incongruity between modalities, such as expressing verbal compliments while rolling eyes, which is an obvious cue for detecting sarcasm. In this article, we propose the incongruity-aware attention network (IWAN), which detects sarcasm by focusing on the word-level incongruity between modalities via a scoring mechanism. This scoring mechanism could assign larger weights to words with incongruent modalities. Experimental results demonstrate the effectiveness of our proposed IWAN model, which not only achieves the state-of-the-art performance on the MUStARD dataset but also offers the advantages of interpretability.

中文翻译：

多模态讽刺检测模态之间的建模不一致

讽刺是一种复杂的语言现象，通常出现在社交媒体平台上，这对意见挖掘系统提出了巨大挑战。因此，旨在理解视频中隐含情感的多模态讽刺检测越来越受到关注。然而，以前的工作主要集中在多模态特征融合，而没有明确地对模态之间的不协调进行建模，例如在翻白眼的同时表达口头赞美，这是检测讽刺的明显线索。在本文中，我们提出了不一致感知注意力网络（IWAN），它通过评分机制关注模态之间的词级不一致来检测讽刺。这种评分机制可以为形式不一致的单词分配更大的权重。

更新日期：2021-03-26

点击分享查看原文

点击收藏

阅读更多本刊最新论文本刊介绍/投稿指南11