首页> 外文会议>Conference on empirical methods in natural language processing >Comparative Studies of Detecting Abusive Language on Twitter
【24h】

Comparative Studies of Detecting Abusive Language on Twitter

机译:检测推特滥用语言的比较研究

获取原文

摘要

The context-dependent nature of online aggression makes annotating large collections of data extremely difficult. Previously studied datasets in abusive language detection have been insufficient in size to efficiently train deep learning models. Recently, Hate and Abusive Speech on Twitter, a dataset much greater in size and reliability, has been released. However, this dataset has not been comprehensively studied to its potential. In this paper, we conduct the first comparative study of various learning models on Hate and Abusive Speech on Twitter, and discuss the possibility of using additional features and context data for improvements. Experimental results show that bidirectional GRU networks trained on word-level features, with Latent Topic Clustering modules, is the most accurate model scoring 0.805 F1.
机译:在线侵略的上下文相关性使注释大量数据集非常困难。以前研究的滥用语言检测的数据集不足以有效地培训深度学习模型。最近,在Twitter上讨厌和滥用言论,数据集的规模和可靠性要大得多,已经发布。但是,该数据集尚未全面研究其潜力。在本文中,我们对Twitter上的仇恨和滥用语音进行各种学习模型进行了第一个比较研究,并讨论了使用其他功能和上下文数据进行改进的可能性。实验结果表明,在单词级功能上培训的双向GRU网络,具有潜在主题聚类模块,是最精确的模型评分0.805 F1。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号