首页> 外文会议>Annual meeting of the Association for Computational Linguistics >Adversarial Multitask Learning for Joint Multi-Feature and Multi-Dialect Morphological Modeling
【24h】

Adversarial Multitask Learning for Joint Multi-Feature and Multi-Dialect Morphological Modeling

机译:对抗性多任务学习,用于联合多特征和多方言形态建模

获取原文

摘要

Morphological tagging is challenging for morphologically rich languages due to the large target space and the need for more training data to minimize model sparsity. Dialectal variants of morphologically rich languages suffer more as they tend to be more noisy and have less resources. In this paper we explore the use of multitask learning and adversarial training to address morphological richness and dialectal variations in the context of full morphological tagging. We use multitask learning for joint morphological modeling for the features within two dialects, and as a knowledge-transfer scheme for cross-dialectal modeling. We use adversarial training to learn dialect invariant features that can help the knowledge-transfer scheme from the high to low-resource variants. We work with two dialectal variants: Modern Standard Arabic (high-resource "dialect"1) and Egyptian Arabic (low-resource dialect) as a case study. Our models achieve state-of-the-art results for both. Furthermore, adversarial training provides more significant improvement when using smaller training datasets in particular.
机译:由于目标空间较大,并且需要更多的训练数据以最大程度地减少模型稀疏性,因此形态学标记对于形态丰富的语言而言是具有挑战性的。形态丰富的语言的方言变体遭受更多的干扰,因为它们往往更嘈杂且资源更少。在本文中,我们探索了使用多任务学习和对抗训练来解决形态学标记完整情况下的形态学丰富性和方言变异。我们将多任务学习用于两个方言内的特征的联合形态学建模,并将其用作跨语言建模的知识转移方案。我们使用对抗训练来学习方言不变性,这些特性可以帮助知识转移方案从高资源变种到低资源变种。我们使用两种方言变体:案例研究:现代标准阿拉伯语(资源丰富的方言)1和埃及阿拉伯语(资源匮乏的方言)。我们的模型都为这两种方法取得了最先进的结果。此外,特别是在使用较小的训练数据集时,对抗训练可提供更大的改进。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号