首页> 外文会议>SIGMORPHON workshop on computational research in phonetics, phonology, and morphology;Annual meeting of the Association for Computational Linguistics >CUNI-Malta system at SIGMORPHON 2019 Shared Task on Morphological Analysis and Lemmatization in context: Operation-based word formation
【24h】

CUNI-Malta system at SIGMORPHON 2019 Shared Task on Morphological Analysis and Lemmatization in context: Operation-based word formation

机译:SIGMORPHON 2019上的CUNI-Malta系统在语境下的形态分析和词法化共享任务:基于操作的词形成

获取原文
获取外文期刊封面目录资料

摘要

This paper presents the submission by the Charles University-University of Malta team to the SIGMORPHON 2019 Shared Task on Morphological Analysis and Lemmatization in context. We present a lemmatization model based on previous work on neural transducers (Makarov and Clematide, 2018b; Aharoni and Goldberg, 2016). The key difference is that our model transforms the whole word form in every step, instead of consuming it character by character. We propose a merging strategy inspired by Byte-Pair-Encoding that reduces the space of valid operations by merging frequent adjacent operations. The resulting operations not only encode the actions to be performed but the relative position in the word token and how characters need to be transformed. Our morphological tagger is a vanilla biLSTM tagger that operates over operation representations, encoding operations and words in a hierarchical manner. Even though relative performance according to metrics is below the baseline, experiments show that our models capture important associations between interpretable operation labels and fine-grained morpho-syntax labels.
机译:本文介绍了马耳他查尔斯大学-大学团队向SIGMORPHON 2019语境中的形态分析和拔除共同任务提交的材料。我们基于先前在神经传感器上的工作提出了一个词消质化模型(Makarov和Clematide,2018b; Aharoni和Goldberg,2016)。关键区别在于我们的模型在每个步骤中都会转换整个单词的形式,而不是逐个字符地使用它。我们提出了一种基于字节对编码的合并策略,该策略通过合并频繁的相邻操作来减少有效操作的空间。产生的操作不仅对要执行的动作进行编码,而且对单词令牌中的相对位置以及如何转换字符进行编码。我们的形态标记器是一种普通的biLSTM标记器,可对操作表示进行操作,对操作和单词进行分层编码。即使根据指标的相对性能低于基线,实验也表明,我们的模型捕获了可解释的操作标签和细粒度的形态语法标签之间的重要关联。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号