首页> 外文会议>Nordic conference of computational Linguistics >Morphosyntactic Disambiguation in an Endangered Language Setting
【24h】

Morphosyntactic Disambiguation in an Endangered Language Setting

机译:在濒危语言环境中的形态学歧义

获取原文

摘要

Endangered Uralic languages present a high variety of inflectional forms in their morphology. This results in a high number of homonyms in inflections, which introduces a lot of morphological ambiguity in sentences. Previous research has employed constraint grammars to address this problem, however CGs arc often unable to fully disambiguate a sentence, and their development is labour intensive. We present an LSTM based model for automatically ranking morphological readings of sentences based on their quality. This ranking can be used to evaluate the existing CG disambiguators or to directly morphologically disambiguate sentences. Our approach works on a morphological abstraction and it can be trained with a very small dataset.
机译:濒临灭绝的尿尿素语言在其形态上呈现出高多种折菌形式。这导致拐点中的谐音数量很多,这引入了句子中的许多形态模糊。以前的研究已经采用了约束语法来解决这个问题,但CGS Arc经常无法充分消除判决,他们的发展是劳动密集型。我们提出了一种基于LSTM的模型,可根据其质量自动排序句子的形态读数。该排名可用于评估现有的CG消歧员或直接形态学歧义句子。我们的方法在形态抽象上工作,可以使用非常小的数据集进行培训。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号