首页> 外文会议>International Workshop on Semantic Evaluation >NLP_Passau at SemEval-2020 Task 12: Multilingual Neural Network for Offensive Language Detection in English, Danish and Turkish
【24h】

NLP_Passau at SemEval-2020 Task 12: Multilingual Neural Network for Offensive Language Detection in English, Danish and Turkish

机译:NLP_Passau在Semeval-2020任务12:多语种神经网络用于英语,丹麦语和土耳其语中的攻击性语言检测

获取原文

摘要

This paper describes a neural network (NN) model that was used for participating in the OffensEval, Task 12 of the SemEval 2020 workshop. The aim of this task is to identify offensive speech in social media, specifically in tweets. The model we used, C-BiGRU, is composed of a Convolutional Neural Network (CNN) along with a bidirectional Recurrent Neural Network (RNN). A multidimensional numerical representation (embedding) for each of the words in the tweets, that were used by the model, was determined using fastText. This was utilized with a dataset of labeled tweets to train the model on detecting combinations of words that may convey an offensive meaning. The model was then used in the sub-task A of the English, Turkish and Danish competitions of the workshop, achieving F1 scores of 90.88%, 76.76% and 76.70%, respectively.
机译:本文介绍了用于参与OffenseVal的神经网络(NN)模型,Semeval 2020车间的任务12。 这项任务的目的是在社交媒体中识别令人反感的演讲,特别是在推文中。 我们使用的模型C-BIGRU由卷积神经网络(CNN)组成,以及双向反复性神经网络(RNN)。 使用FastText确定模型中每个单词的多维数值表示(嵌入),这些数值表示该促装中的每种单词。 这是与标记推文的数据集一起使用,以培训模型检测可能传达令人反感含义的单词组合。 然后将该模型用于研讨会的英语,土耳其和丹麦竞赛的子任务A,分别实现了90.88%,76.76%和76.70%的F1分数。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号