首页> 外文会议>Workshop on Arabic Natural Language Processing >LTG-ST at NADI Shared Task 1: Arabic Dialect Identification using a Stacking Classifier
【24h】

LTG-ST at NADI Shared Task 1: Arabic Dialect Identification using a Stacking Classifier

机译:在NADI共享任务1:使用堆叠分类器的阿拉伯语方言识别

获取原文

摘要

This paper presents our results for the Nuanced Arabic Dialect Identification (NADI) shared task of the Fifth Workshop for Arabic Natural Language Processing (WANLP 2020). We participated in the first sub-task for country-level Arabic dialect identification covering 21 Arab countries. Our contribution is based on a stacking classifier using Multinomial Naive Bayes,Linear SVC,and Logistic Regression classifiers as estimators; followed by a Logistic Regression as final estimator. Despite the fact that the results on the test set were low,with a macro F1 of 17.71,we were able to show that a simple approach can achieve comparable results to more sophisticated solutions. Moreover,the insights of our error analysis,and of the corpus content in general,can be used to develop and improve future systems.
机译:本文介绍了阿拉伯语自然语言处理第五次研讨会(WANLP 2020)第五次研讨会的分享任务的结果。 我们参加了涵盖21个阿拉伯国家的国家级阿拉伯语方言识别的第一个子任务。 我们的贡献基于使用多元幼稚贝叶斯,线性SVC和Logistic回归分类器作为估计器的堆叠分类器; 其次是作为最终估算者的逻辑回归。 尽管测试集的结果低,但宏F1为17.71,我们能够表明简单的方法可以实现更复杂的解决方案的可比结果。 此外,我们的错误分析和常规语料库内容的见解可用于开发和改进未来的系统。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号