Multimodal Meme Dataset (MultiOFF) for Identifying Offensive Content in Image and Text

机译：多模式模因数据集（MultiOFF），用于识别图像和文本中令人反感的内容

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

A meme is a form of media that spreads an idea or emotion across the internet. As posting meme has become a new form of communication of the web, due to the multimodal nature of memes, postings of hateful memes or related events like trolling, cyberbullying are increasing day by day. Hate speech, offensive content and aggression content detection have been extensively explored in a single modality such as text or image. However, combining two modalities to detect offensive content is still a developing area. Memes make it even more challenging since they express humour and sarcasm in an implicit way, because of which the meme may not be offensive if we only consider the text or the image. Therefore, it is necessary to combine both modalities to identify whether a given meme is offensive or not. Since there was no publicly available dataset for multimodal offensive meme content detection, we leveraged the memes related to the 2016 U.S. presidential election and created the MultiOFF multimodal meme dataset for offensive content detection dataset. We subsequently developed a classifier for this task using the MultiOFF dataset. We use an early fusion technique to combine the image and text modality and compare it with a text- and an image-only baseline to investigate its effectiveness. Our results show improvements in terms of Precision, Recall, and F-Score.

机译：模因是在互联网上传播想法或情感的一种媒体形式。由于模因的多模式性质，发布模因已经成为网络通信的一种新形式，仇恨模因或相关事件（例如拖钓，网络欺凌）的发布正日益增多。仇恨言论，攻击性内容和攻击性内容检测已在单一形式（例如文本或图像）中进行了广泛探索。但是，结合两种方式来检测令人反感的内容仍然是一个发展中的领域。由于模因以隐式方式表达幽默和讽刺，模因使其更具挑战性，因此，如果仅考虑文字或图像，则模因可能不会令人反感。因此，有必要结合两种方式来识别给定的模因是否令人反感。由于没有公开的多模态攻击性模因内容检测数据集，我们利用与2016年美国总统大选相关的模因，为攻击性内容检测数据集创建了MultiOFF多模态模因数据集。随后，我们使用MultiOFF数据集为此任务开发了分类器。我们使用一种早期的融合技术将图像和文本模态进行组合，并将其与仅文本和仅图像的基线进行比较，以研究其有效性。我们的结果表明，在精度，召回率和F得分方面均得到了改善。

著录项

来源
《Workshop on Trolling, Aggression and Cyberbullying》|2020年|32-41|共10页
会议地点
作者
Shardul Suryawanshi; Bharathi Raja Chakravarthi; Mihael Arcan; Paul Buitelaar;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
multimodal data; classification; memes; offensive content; opinion mining;

机译：多峰数据分类;模因令人反感的内容;意见挖掘;

相似文献

外文文献
中文文献
专利

1. Self-supervised multimodal reconstruction of retinal images over paired datasets [J] . Hervella Alvaro S., Rouco Jose, Novo Jorge, Expert Systems with Application . 2020,第Deca期

机译：复合数据集的自我监督多峰重建视网膜图像
2. Textural and Geometrical Features Based Approach for Identification of Individuals Using Palmprint and Hand Shape Images from Multiple Multimodal Datasets [J] . Shaukat Anum, Farhan Saima, Fahiem Muhammad Abuzar, Journal of testing and evaluation . 2018,第6期

机译：基于纹理和几何特征的个人识别方法，使用多个多峰数据集中的掌纹和手形图像进行识别
3. Cursive-Text: A Comprehensive Dataset for End-to-End Urdu Text Recognition in Natural Scene Images [J] . Asghar Ali Chandio, Md. Asikuzzaman, Mark Pickering, Data in Brief . 2020,第3期

机译：Cursive-Text：自然场景图像中的端到端核心文本识别的全面数据集
4. News2meme: An Automatic Content Generator from News Based on Word Subspaces from Text and Image [C] . Erica K. Shimomoto, Lincon S. Souza, Bernardo B. Gatto, Proceedings of the Sixteenth International Conference on Machine Vision Applications . 2019

机译：News2meme：来自文本和图像的Word子空间的新闻自动内容生成器
5. Multidataset Independent Subspace Analysis: A Framework for Analysis of Multimodal, Multi-Subject Brain Imaging Data [D] . Silva, Rogers F. 2017

机译：独立于多数据集的子空间分析：一种用于分析多模态，多主体脑成像数据的框架
6. Towards Answering Biological Questions with Experimental Evidence: Automatically Identifying Text that Summarize Image Content in Full-Text Articles [O] . Hong Yu 2006

机译：尝试用实验证据回答生物学问题：自动识别全文文章中包含图像内容的文本
7. Figure 4: (A) One conserved sequence, which occurs 79 times in 46,264 binding site peaks from the ChIP-seq data-set. The mutation profile of this conserved sequence is illustrated, where ’_ ’ indicates this base is unchanged; DEL indicates this base is lost; INS X indicates a new base X is inserted in front of this base. (B) Several repeated elements patterns are listed. (C) In the first column, the top five DNA motifs, mined by meme-chip tools (Machanick Bailey, 2011) are illustrated. The resemblant conserved sequences, found by the CFSP algorithm are listed in the second column. In the third column, the position-specific scoring matrices, which are transformed from mutational information are listed. The similarity between meme motif and resemblant conserved sequence with PSSM format was calculated via a stamp motif comparison tool (Mahony Benos, 2007). The E-values for the similarity of those pairs is displayed in the fourth column. (D) One motif is selected in each group clustered by gkmsvm descriptors, and the corresponding motif found by the CFSP algorithm is listed below. (E) There are additional datasets (File No: ENCFF100GRL, ENCFF616IRT, ENCFF870CER, Target: SREBF1) collected from https://www.encodeproject.org. The top two motifs are selected in each file using meme tools, and the corresponding motifs found by our algorithm are listed below. [O] . -1

机译：图4：（a）一种保守序列，其发生在芯片-SEQ数据集中的46,264个结合位点峰值中的79倍。说明了这种保守序列的突变分布，其中'_'表示该碱度不变; del表示此基础丢失; INS X表示新的基础X插入此基础前面。（b）列出了几种重复的元素模式。（c）在第一栏中，示出了由MEME芯片工具（Machanick＆Bailey，2011）开采的前五个DNA主题。由CFSP算法发现的相应保守序列列于第二列中。在第三列中，列出了从突变信息转换的特定位置的评分矩阵。 MEME主题与PSSM格式的相似性与PSSM格式之间的相似性通过邮票图章比较工具（Mahony＆Benos，2007）计算。这些对相似性的电子值显示在第四列中。（d）在由GKMSVM描述符聚集的每个组中选择了一个图案，下面列出了CFSP算法的相应主题。（e）从https://www.encodeproject.org收集的，有附加数据集（文件no：cernff100grl，cenf616irl，conf8.20cer，target：srebf1）。使用MEME工具在每个文件中选择前两个图案，并且我们的算法发现的相应主题如下所示。

Multimodal Meme Dataset (MultiOFF) for Identifying Offensive Content in Image and Text

摘要

著录项

相似文献

相关主题

期刊订阅