首页> 外文会议>SIGMORPHON workshop on computational research in phonetics, phonology, and morphology >Sigmorphon 2019 Task 2 system description paper: Morphological analysis in context for many languages, with supervision from only a few
【24h】

Sigmorphon 2019 Task 2 system description paper: Morphological analysis in context for many languages, with supervision from only a few

机译:Sigmorphon 2019任务2系统描述论文:许多语言的语境中的形态分析,只有几个语言的监督

获取原文

摘要

This paper presents the UNT HiLT+Ling system for the Sigmorphon 2019 shared Task 2: Morphological Analysis and Lemmatization in Context. Our core approach focuses on the morphological tagging task; part-of-speech tagging and lemmatization are treated as secondary tasks. Given the highly multilingual nature of the task, we propose an approach which makes minimal use of the supplied training data, in order to be extensible to languages without labeled training data for the morphological analysis task. Specifically, we use a parallel Bible corpus to align contextual embeddings at the verse level. The aligned verses are used to build cross-language translation matrices, which in turn are used to map between embedding spaces for the various languages. Finally, we use sets of inflected forms, primarily from a high-resource language, to induce vector representations for individual UniMorph tags. Morphological tagging is performed by matching vector representations to embeddings for individual tokens. While our system results are dramatically below the average system submitted for the shared task evaluation campaign, our method is (we suspect) unique in its minimal reliance on labeled training data.
机译:本文介绍了Sigmorphon 2019年共享任务2的Unt Hilt + Ling系统:语境中的形态分析和lemmatization。我们的核心方法侧重于形态学标记任务;词语份额标记和lemmatization被视为辅助任务。鉴于任务的高度多语言性质,我们提出了一种方法,这使得提供的培训数据最少使用,以便在没有标记的形态分析任务的培训数据的情况下可以扩展语言。具体来说,我们使用并行圣经语料库来对齐verse水平的上下文嵌入。对齐的经验用于构建跨语言转换矩阵,其又用于映射嵌入空间以获得各种语言。最后,我们使用主要来自高资源语言的流动形式集,诱导个体unimorph标签的矢量表示。通过将载体表示来执行形态标记,以便为单个令牌嵌入嵌入。虽然我们的系统结果显着低于为共享任务评估活动提交的平均系统,但我们的方法是(我们怀疑)在最小的依赖标记训练数据的依赖中是独一无二的。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号