首页> 外文会议>Workshop on Arabic Natural Language Processing >BERT-based Multi-Task Model for Country and Province Level Modern Standard Arabic and Dialectal Arabic Identification

【24h】

BERT-based Multi-Task Model for Country and Province Level Modern Standard Arabic and Dialectal Arabic Identification

机译：基于BERT的国家和省级多任务模型现代标准阿拉伯语和方言阿拉伯语鉴定

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Dialect and standard language identification are crucial tasks for many Arabic natural language processing applications. In this paper, we present our deep learning-based system, submitted to the second NADI shared task for country-level and province-level identification of Modern Standard Arabic (MSA) and Dialectal Arabic (DA). The system is based on an end-to-end deep Multi-Task Learning (MTL) model to tackle both country-level and province-level MSA/DA identification. The latter MTL model consists of a shared Bidirectional Encoder Representation Transformers (BERT) encoder, two task-specific attention layers, and two classifiers. Our key idea is to leverage both the task-discriminative and the inter-task shared features for country and province MSA/DA identification. The obtained results show that our MTL model outperforms single-task models on most subtasks.

机译：方言和标准语言识别是许多阿拉伯语自然语言处理应用的重要任务。在本文中，我们介绍了我们的深度学习的系统，提交了第二个NADI共享任务，以获得现代标准阿拉伯语（MSA）和辩证阿拉伯语（DA）的国家级和省级识别。该系统基于端到端的深度多任务学习（MTL）模型来解决国家级和省级MSA / DA标识。后一MTL模型由共享双向编码器表示变压器（BERT）编码器，两个特定于任务的注意层和两个分类器组成。我们的主要思想是利用国家和省MSA / DA识别的任务鉴别和任务间共享特征。所获得的结果表明，我们的MTL模型在大多数子任务上优于单任务模型。

著录项

来源
《Workshop on Arabic Natural Language Processing》|2021年|271-275|共5页
会议地点
作者
Abdellah El Mekki; Abdelkader El Mahdaouy; Kabil Essefar; Nabil El Mamoun; Ismail Berrada; Ahmed Khoumsi;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
入库时间 2022-08-26 13:58:11

相似文献

外文文献
中文文献
专利

1. Wasf-Vec: Topology-based Word Embedding for Modern Standard Arabic and Iraqi Dialect Ontology [J] . Abdulhameed Tiba Zaki, Zitouni Imed, Abdel-Qader Ikhlas ACM transactions on Asian and low-resource language information processing . 2020,第2期

机译：WASF-VEC：拓扑的Word嵌入现代标准的阿拉伯语和伊拉克方言本体
2. Rule-Based Machine Translation from Tunisian Dialect to Modern Standard Arabic [J] . Mohamed Ali Sghaier, Mounir Zrigui Procedia Computer Science . 2020,第5期

机译：基于规则的机器翻译从突尼斯方言到现代标准阿拉伯语
3. Supra Segmental Phonology in Skaka Dialect and Its Relation to the Modern Standard Arabic [J] . Atalah Al-Rubaat, Anas Ahmed Qarqaz Open Journal of Modern Linguistics . 2019,第5期

机译：斯卡卡语的超音节音系学及其与现代标准阿拉伯语的关系
4. Country-level Arabic Dialect Identification Using Small Datasets with Integrated Machine Learning Techniques and Deep Learning Models [C] . Maha J. Althobaiti Workshop on Arabic Natural Language Processing . 2021

机译：国家一级的阿拉伯语方言识别，使用小型数据集具有集成机器学习技术和深度学习模型
5. The impact of computer assisted language learning adhering to the national standards for foreign language learning: A focus on modern standard Arabic at the university level. [D] . El Omari, Samir. 2014

机译：遵循国家外语学习标准的计算机辅助语言学习的影响：在大学级别上关注现代标准阿拉伯语。
6. Morphological structure in the Arabic mental lexicon: Parallels between standard and dialectal Arabic [O] . Sami Boudelaa, William D. Marslen-Wilson -1

机译：阿拉伯语心理词典中的形态结构：标准阿拉伯语与方言阿拉伯语之间的平行
7. Supra Segmental Phonology in Skaka Dialect and Its Relation to the Modern Standard Arabic [O] . Atalah Al-Rubaat, Anas Ahmed Qarqaz 2019

机译：Supra在斯卡卡方言中的节段语音学及其与现代标准阿拉伯语的关系

BERT-based Multi-Task Model for Country and Province Level Modern Standard Arabic and Dialectal Arabic Identification

摘要

著录项

相似文献

相关主题

期刊订阅