首页> 外文会议>Annual meeting of the Association for Computational Linguistics >A Multi-lingual Multi-task Architecture for Low-resource Sequence Labeling
【24h】

A Multi-lingual Multi-task Architecture for Low-resource Sequence Labeling

机译:用于低资源序列标记的多语言多任务架构

获取原文

摘要

We propose a multi-lingual multi-task architecture to develop supervised models with a minimal amount of labeled data for sequence labeling. In this new architecture, we combine various transfer models using two layers of parameter sharing. On the first layer, we construct the basis of the architecture to provide universal word representation and feature extraction capability for all models. On the second level, we adopt different parameter sharing strategies for different transfer schemes. This architecture proves to be particularly effective for low-resource settings, when there are less than 200 training sentences for the target task. Using Name Tagging as a target task, our approach achieved 4.3%-50.5% absolute F-score gains compared to the mono-lingual single-task baseline model.
机译:我们提出了一种多语言多任务架构,用于开发具有最小数量的序列标记的标记数据的监督模型。在这一新架构中,我们使用两层参数共享组合各种传输模型。在第一层上,我们构建架构的基础,为所有模型提供通用字表示和特征提取功能。在二级,我们采用不同的参数共享策略进行不同的转移方案。当目标任务的训练句不到200个培训句子时,这种架构对低资源设置表示特别有效。使用名称标记为目标任务,与单语言单任务基线模型相比,我们的方法实现了4.3%-50.5%的绝对F-Score收益。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号