首页> 外文会议>Annual conference of the North American Chapter of the Association for Computational Linguistics: human language technologies >NLPR@SRPOL at SemEval-2019 Task 6: Linguistically enhanced deep learning offensive sentence classifier
【24h】

NLPR@SRPOL at SemEval-2019 Task 6: Linguistically enhanced deep learning offensive sentence classifier

机译:NLPR @ SRPOL在Semeval-2019任务6:语言上增强深入学习进攻句子分类器

获取原文

摘要

The paper presents a system developed for the SemEval-2019 competition Task 5 hat-Eval Basile et al. (2019) (team name: LU Team) and Task 6 OffensEval Zampieri et al. (2019b) (team name: NLPR@SRPOL), where we achieved 2nd position in Subtask C. The system combines in an ensemble several models (LSTM, Transformer, OpenAI's GPT, Random forest, SVM) with various embeddings (custom, ELMo, fastText, Universal Encoder) together with additional linguistic features (number of blacklisted words, special characters, etc.). The system works with a multi-tier blacklist and a large corpus of crawled data, annotated for general offensiveness. In the paper we do an extensive analysis of our results and show how the combination of features and embedding affect the performance of the mod-els.
机译:本文提出了一个为Semeval-2019竞争任务5 Hat-Evg Basile等人开发的系统。 (2019年)(团队名称:LU队)和任务6 offenseval Zampieri等。 (2019B)(团队名称:NLPR @ SRPOL),我们在SubTask C中实现了第二个位置。该系统在集合的几种型号(LSTM,变压器,OpenAI的GPT,随机林,SVM)中结合了各种嵌入式(定制,Elmo, FastText,Universal Encoder)以及附加语言特征(黑名单单词,特殊字符等)。该系统适用于多层黑名单和大型爬网数据的语料库,用于一般冒险性。在论文中,我们对我们的结果进行了广泛的分析,并展示了特征和嵌入的结合如何影响Mod-ELS的性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号