【24h】

NELEC at SemEval-2019 Task 3: Think Twice Before Going Deep

机译:NELEC在SemEval-2019任务3:深入研究之前先三思

获取原文

摘要

Existing Machine Learning techniques yield close to human performance on text-based classification tasks. However, the presence of multi-modal noise in chat data such as emoti-cons, slang, spelling mistakes, code-mixed data, etc. makes existing deep-learning solutions perform poorly. The inability of deep-learning systems to robustly capture these covariates puts a cap on their performance. We propose NELEC : Neural and Lexical Combiner, a system which elegantly combines textual and deep-learning based methods for sentiment classification. We evaluate our system as part of the third task of 'Contextual Emotion Detection in Text' as part of SemEval-2019 (Chatterjee et al., 2019b). Our system performs significantly better than the baseline, as well as our deep-learning model benchmarks. It achieved a micro-averaged F_1 score of 0.7765, ranking 3rd on the test-set leader-board.
机译:现有的机器学习技术在基于文本的分类任务上的表现接近人类。但是,聊天数据中的多模式噪声(如表情符号,语,拼写错误,代码混合数据等)的存在使现有的深度学习解决方案性能较差。深度学习系统无法强大地捕获这些协变量,从而限制了它们的性能。我们建议使用NELEC:神经和词汇组合器,该系统将基于文本和深度学习的方法完美地结合在一起,用于情感分类。我们将系统评估为SemEval-2019(Chatterjee et al。,2019b)的第三个任务“文本中的情境情感检测”的一部分。我们的系统的性能明显优于基线以及我们的深度学习模型基准。它的微平均F_1得分为0.7765,在测试集排行榜上排名第三。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号