首页> 外文会议>Workshop on Trolling, Aggression and Cyberbullying >Towards Non-Toxic Landscapes: Automatic Toxic Comment Detection Using DNN
【24h】

Towards Non-Toxic Landscapes: Automatic Toxic Comment Detection Using DNN

机译:迈向无毒景观:使用DNN的自动有毒评论检测

获取原文

摘要

The spectacular expansion of the Internet has led to the development of a new research problem in the field of natural language processing: automatic toxic comment detection, since many countries prohibit hate speech in public media. There is no clear and formal definition of hate, offensive, toxic and abusive speeches. In this article, we put all these terms under the umbrella of "toxic speech". The contribution of this paper is the design of binary classification and regression-based approaches aiming to predict whether a comment is toxic or not. We compare different unsupervised word representations and different DNN based classifiers. Moreover, we study the robustness of the proposed approaches to adversarial attacks by adding one (healthy or toxic) word. We evaluate the proposed methodology on the English Wikipedia Detox corpus. Our experiments show that using BERT fine-tuning outperforms feature-based BERT, Mikolov's and fastText representations wim different DNN classifiers.
机译:互联网的惊人发展导致自然语言处理领域出现了一个新的研究问题:自动有毒评论检测,因为许多国家禁止公共媒体中的仇恨言论。仇恨,令人反感,有毒和辱骂性言论尚无明确和正式的定义。在本文中,我们将所有这些术语置于“有毒言论”的保护之下。本文的贡献是设计了基于二进制分类和回归的方法,旨在预测评论是否有毒。我们比较了不同的无监督词表示形式和不同的基于DNN的分类器。此外,我们通过添加一个单词(健康或有毒)来研究所提出的对抗攻击方法的鲁棒性。我们在英语维基百科排毒语料库上评估了所提出的方法。我们的实验表明,使用BERT进行微调的效果优于基于功能的BERT,Mikolov的和fastText表示在不同的DNN分类器上实现。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号