首页> 外文会议>Workshop on Trolling, Aggression and Cyberbullying >Multimodal Meme Dataset (MultiOFF) for Identifying Offensive Content in Image and Text
【24h】

Multimodal Meme Dataset (MultiOFF) for Identifying Offensive Content in Image and Text

机译:多模式模因数据集(MultiOFF),用于识别图像和文本中令人反感的内容

获取原文

摘要

A meme is a form of media that spreads an idea or emotion across the internet. As posting meme has become a new form of communication of the web, due to the multimodal nature of memes, postings of hateful memes or related events like trolling, cyberbullying are increasing day by day. Hate speech, offensive content and aggression content detection have been extensively explored in a single modality such as text or image. However, combining two modalities to detect offensive content is still a developing area. Memes make it even more challenging since they express humour and sarcasm in an implicit way, because of which the meme may not be offensive if we only consider the text or the image. Therefore, it is necessary to combine both modalities to identify whether a given meme is offensive or not. Since there was no publicly available dataset for multimodal offensive meme content detection, we leveraged the memes related to the 2016 U.S. presidential election and created the MultiOFF multimodal meme dataset for offensive content detection dataset. We subsequently developed a classifier for this task using the MultiOFF dataset. We use an early fusion technique to combine the image and text modality and compare it with a text- and an image-only baseline to investigate its effectiveness. Our results show improvements in terms of Precision, Recall, and F-Score.
机译:模因是在互联网上传播想法或情感的一种媒体形式。由于模因的多模式性质,发布模因已经成为网络通信的一种新形式,仇恨模因或相关事件(例如拖钓,网络欺凌)的发布正日益增多。仇恨言论,攻击性内容和攻击性内容检测已在单一形式(例如文本或图像)中进行了广泛探索。但是,结合两种方式来检测令人反感的内容仍然是一个发展中的领域。由于模因以隐式方式表达幽默和讽刺,模因使其更具挑战性,因此,如果仅考虑文字或图像,则模因可能不会令人反感。因此,有必要结合两种方式来识别给定的模因是否令人反感。由于没有公开的多模态攻击性模因内容检测数据集,我们利用与2016年美国总统大选相关的模因,为攻击性内容检测数据集创建了MultiOFF多模态模因数据集。随后,我们使用MultiOFF数据集为此任务开发了分类器。我们使用一种早期的融合技术将图像和文本模态进行组合,并将其与仅文本和仅图像的基线进行比较,以研究其有效性。我们的结果表明,在精度,召回率和F得分方面均得到了改善。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号