Machine Learning-Based Approach for Arabic Dialect Identification

机译：基于机器学习的阿拉伯语方言识别方法

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

This paper describes our systems submitted to the Second Nuanced Arabic Dialect Identification Shared Task (NADI 2021). Dialect identification is the task of automatically detecting the source variety of a given text or speech segment. There are four subtasks, two sub-tasks for country-level identification and the other two subtasks for province-level identification. The data in this task covers a total of 100 provinces from all 21 Arab countries and come from the Twitter domain. The proposed systems depend on five machine-learning approaches namely Complement Naieve Bayes, Support Vector Machine, Decision Tree, Logistic Regression and Random Forest Classifiers. F_1 macro-averaged score of Naieve Bayes classifier outperformed all other classifiers for development and test data.

机译：本文介绍了我们提交给第二个细微患者阿拉伯语方言识别共享任务的系统（NADI 2021）。方言识别是自动检测给定文本或语音段的源多样的任务。有四个子任务，国家级别标识的两个子任务以及省级识别的其他两个子任务。该任务中的数据涵盖了来自所有21个阿拉伯国家的100个省份，并来自Twitter领域。所提出的系统依赖于五种机器学习方法，即补充贝叶斯，支持向量机，决策树，逻辑回归和随机林分类器。 F_1宏观平均得分明智的贝叶斯分类器优于开发和测试数据的所有其他分类器。

著录项

来源
《Workshop on Arabic Natural Language Processing》|2021年|287-290|共4页
会议地点
作者
Mahmoud S. Ali; Ahmed H. Ali; Ahmed A. El-Sawy; Hamada A. Nayel;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
入库时间 2022-08-26 13:58:09

相似文献

外文文献
中文文献
专利

1. Arabic tweeps dialect prediction based on machine learning approach [J] . Khaled Alrifai, Ghaida Rebdawi, Nada Ghneim International Journal of Electrical and Computer Engineering . 2021,第2期

机译：阿拉伯语滴动基于机器学习方法的方言预测
2. Compression versus traditional machine learning classifiers to detect code-switching in varieties and dialects: Arabic as a case study [J] . Taghreed Tarmom, William Teahan, Eric Atwell, Natural language engineering . 2020,第Pta6期

机译：压缩与传统机器学习分类器检测品种和方言中的码切换：阿拉伯语作为一个案例研究
3. Rule-Based Machine Translation from Tunisian Dialect to Modern Standard Arabic [J] . Mohamed Ali Sghaier, Mounir Zrigui Procedia Computer Science . 2020,第5期

机译：基于规则的机器翻译从突尼斯方言到现代标准阿拉伯语
4. Country-level Arabic Dialect Identification Using Small Datasets with Integrated Machine Learning Techniques and Deep Learning Models [C] . Maha J. Althobaiti Workshop on Arabic Natural Language Processing . 2021

机译：国家一级的阿拉伯语方言识别，使用小型数据集具有集成机器学习技术和深度学习模型
5. Machine Translation of Arabic Dialects [D] . Salloum, Wael. 2018

机译：阿拉伯方言的机器翻译
6. A Neural Machine Translation Model for Arabic Dialects That Utilises Multitask Learning (MTL) [O] . Laith H. Baniata, Seyoung Park, Seong-Bae Park 2018

机译：利用多任务学习（MTL）的阿拉伯语神经机器翻译模型
7. Dialectal to Standard Arabic Paraphrasing to Improve Arabic-English Statistical Machine Translation [O] . Salloum Wael Sameer, Habash Nizar Y. 2011

机译：以标准阿拉伯语释义的方言，以改善阿拉伯英语统计机器翻译

Machine Learning-Based Approach for Arabic Dialect Identification

摘要

著录项

相似文献

相关主题

期刊订阅