首页> 外文会议>International conference on computational linguistics >Enhancing Lemmatization for Mongolian and its Application to Statistical Machine Translation
【24h】

Enhancing Lemmatization for Mongolian and its Application to Statistical Machine Translation

机译:加强蒙古族的lemmatization及其在统计机器翻译中的应用

获取原文

摘要

Lemmatization is crucial in natural language processing and information retrieval especially for highly inflected languages, such as Finnish and Mongolian. The state-of-the-art method of lemmatization for Mongolian does not need a noun dictionary and is scalable, but errors of this method are mainly caused by problems related to part of speech (POS) information. To resolve this problem, we integrate POS tagging and lemmatization for Mongolian. We evaluate the effectiveness of our method and its contribution to statistical machine translation.
机译:lemmatization在自然语言处理和信息检索中至关重要,特别是对于芬兰和蒙古等高度变化的语言。蒙古语的最先进的lemmatization方法不需要名词字典并且是可扩展的,但这种方法的错误主要是由与部分语音(POS)信息相关的问题引起的。为了解决这个问题,我们将POS标记和偏执留在蒙古语中整合。我们评估我们对统计机器翻译的方法的有效性及其贡献。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号