LTG-ST at NADI Shared Task 1: Arabic Dialect Identification using a Stacking Classifier

机译：在NADI共享任务1：使用堆叠分类器的阿拉伯语方言识别

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

This paper presents our results for the Nuanced Arabic Dialect Identification (NADI) shared task of the Fifth Workshop for Arabic Natural Language Processing (WANLP 2020). We participated in the first sub-task for country-level Arabic dialect identification covering 21 Arab countries. Our contribution is based on a stacking classifier using Multinomial Naive Bayes,Linear SVC,and Logistic Regression classifiers as estimators; followed by a Logistic Regression as final estimator. Despite the fact that the results on the test set were low,with a macro F1 of 17.71,we were able to show that a simple approach can achieve comparable results to more sophisticated solutions. Moreover,the insights of our error analysis,and of the corpus content in general,can be used to develop and improve future systems.

机译：本文介绍了阿拉伯语自然语言处理第五次研讨会（WANLP 2020）第五次研讨会的分享任务的结果。我们参加了涵盖21个阿拉伯国家的国家级阿拉伯语方言识别的第一个子任务。我们的贡献基于使用多元幼稚贝叶斯，线性SVC和Logistic回归分类器作为估计器的堆叠分类器; 其次是作为最终估算者的逻辑回归。尽管测试集的结果低，但宏F1为17.71，我们能够表明简单的方法可以实现更复杂的解决方案的可比结果。此外，我们的错误分析和常规语料库内容的见解可用于开发和改进未来的系统。

著录项

来源
《Workshop on Arabic Natural Language Processing》|2020年|313-319|共7页
会议地点
作者
Samia Touileb;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
入库时间 2022-08-26 13:58:09

相似文献

外文文献
中文文献
专利

1. Compression versus traditional machine learning classifiers to detect code-switching in varieties and dialects: Arabic as a case study [J] . Taghreed Tarmom, William Teahan, Eric Atwell, Natural language engineering . 2020,第Pta6期

机译：压缩与传统机器学习分类器检测品种和方言中的码切换：阿拉伯语作为一个案例研究
2. Classifying Sentiment of Dialectal Arabic Reviews: A Semi-Supervised Approach [J] . Al-Harbi Omar The international arab journal of information technology . 2019,第6期

机译：对方言阿拉伯语评论的情绪进行分类：一种半监督方法
3. Cross-dialectal data sharing for acoustic modeling in Arabic speech recognition [J] . Kirchhoff K, Vergyri D Speech Communication . 2005,第1期

机译：跨方言数据共享，用于阿拉伯语音识别中的声学建模
4. NADI 2021: The Second Nuanced Arabic Dialect Identification Shared Task [C] . Muhammad Abdul-Mageed, Chiyu Zhang, AbdelRahim Elmadany, Workshop on Arabic Natural Language Processing . 2021

机译：NADI 2021：第二个细微的阿拉伯语方言识别共享任务
5. Arabic Dialect Identification [D] . Al-Mannai, Kamela Ali 2018

机译：阿拉伯方言识别
6. Morphological structure in the Arabic mental lexicon: Parallels between standard and dialectal Arabic [O] . Sami Boudelaa, William D. Marslen-Wilson -1

机译：阿拉伯语心理词典中的形态结构：标准阿拉伯语与方言阿拉伯语之间的平行
7. Team JUST at the MADAR Shared Task on Arabic Fine-Grained Dialect Identification [O] . Bashar Talafha, Ali Fadel, Mahmoud Al-Ayyoub, 2019

机译：在马尔的队伍同行任务阿拉伯语细粒度方言鉴定

LTG-ST at NADI Shared Task 1: Arabic Dialect Identification using a Stacking Classifier

摘要

著录项

相似文献

相关主题

期刊订阅