首页> 外文会议>International Joint Conference on Neural Networks >A Multimodal Classification of Noisy Hate Speech using Character Level Embedding and Attention
【24h】

A Multimodal Classification of Noisy Hate Speech using Character Level Embedding and Attention

机译:基于字符级嵌入和注意的含噪仇恨语音多模态分类

获取原文

摘要

Hate speech has become a critical problem in all social media, leading to many hate crimes alongside affecting the mental and emotional well-being of affected individuals. This calls out for methods to detect online hate speech more than ever. While numerous architectures exist for hate speech detection in unimodal setup (i.e., either textual or visual) we have targeted the problem in the context of both text and images inspired by the real-world raw data which involves several modalities. We propose a multimodal hate speech classifier, called as Character Text Image Classifier (CTIC), which builds upon Bidirectional Encoder Representations from Transformers (BERT), Capsule Network, and EfficientNet involving four modalities, namely word embeddings, character embeddings, sentence embeddings, and images. We report the experiments performed upon our proposed model and several other models which have been tested in the process. We train our model with different sampling techniques, and selective training, upon a Twitter dataset, called MMHS150K, consisting of both texts and associated images. Our proposed multimodal approach attains better performance than the previous models constructed upon the dataset.
机译:仇恨言论已经成为所有社交媒体中的一个关键问题,导致许多仇恨犯罪,同时影响受影响个人的心理和情绪健康。这就需要比以往任何时候都更多的方法来检测在线仇恨言论。虽然在单峰设置(即文本或视觉)中存在许多用于仇恨语音检测的体系结构,但我们在文本和图像的背景下针对这个问题,这些背景都是受现实世界原始数据的启发,涉及多种模式。我们提出了一种多模态仇恨语音分类器,称为字符-文本-图像分类器(CTIC),它基于来自Transformers(BERT)、Capsule Network和EfficientNet的双向编码器表示,涉及四种模式,即单词嵌入、字符嵌入、句子嵌入和图像。我们报告了在我们提出的模型和其他几个模型上进行的实验,这些模型在这个过程中已经过测试。我们在一个名为MMHS150K的Twitter数据集上使用不同的采样技术和选择性训练来训练我们的模型,该数据集由文本和相关图像组成。我们提出的多模态方法比以前基于数据集构建的模型具有更好的性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号