【24h】

UVA Wahoos at SemEval-2019 Task 6: Hate Speech Identification using Ensemble Machine Learning

机译:UVA Wahoos在SemEval-2019任务6:使用Ensemble Machine Learning进行讨厌的语音识别

获取原文

摘要

With the growth in the usage of social media, it has become increasingly common for people to hide behind a mask and abuse others. We have attempted to detect such tweets and comments that are malicious in intent, which either targets an individual or a group. Our best classifier for identifying offensive tweets for SubTask_A (Classifying offensive vs. non-offensive) has an accuracy of 83.14% and a f1-score of 0.7565 on the actual test data. For SubTask_B, to identify if an offensive tweet is targeted (If targeted towards an individual or a group), the classifier performs with an accuracy of 89.17% and f1-score of 0.5885. The paper talks about how we generated linguistic and semantic features to build an ensemble machine learning model. By training with more extracts from different sources (Face-book, and more tweets), the paper shows how the accuracy changes with additional training data.
机译:随着社交媒体使用的增长,人们躲在面具下并虐待他人变得越来越普遍。我们已经尝试检测到针对个人或群体的恶意推文和评论。我们为SubTask_A识别攻击性推文的最佳分类器(对攻击性和非攻击性进行分类)的准确度为83.14%,实际测试数据的f1-分数为0.7565。对于SubTask_B,要识别攻击性鸣叫是否是针对性的(如果针对个人或团体),分类器的准确度为89.17%,f1得分为0.5885。本文讨论了如何生成语言和语义特征以构建整体机器学习模型。通过使用来自不同来源的更多摘录(Face-book和更多推文)进行训练,本文显示了如何通过附加训练数据来改变准确性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号