首页> 外文会议>International conference on asian language processing >Mongolian Named Entity Recognition using suffixes segmentation
【24h】

Mongolian Named Entity Recognition using suffixes segmentation

机译:蒙古语命名实体识别使用后缀细分

获取原文

摘要

Mongolian is an agglutinative language with the complex morphological structures. Building an accurate Named Entity Recognition (NER) system for Mongolian is a challenging and meaningful work. This paper analyzes the characteristic of Mongolian suffixes using Narrow Non-Break Space and investigates Mongolian NER system under three methods in the Condition Random Field framework. The experiment shows that segmenting each suffix into an individual token achieves the best performance than both without segmenting and using the suffixes as a feature. Our approach obtains an F-measure = 82.71. It is appropriate for the Mongolian large scale vocabulary NER. This research also makes sense to other agglutinative languages NER systems.
机译:蒙古族是一种凝聚语言,具有复杂的形态结构。为蒙古建立一个准确的名为实体识别(NER)系统是一个具有挑战性和有意义的工作。本文分析了狭窄的非断裂空间蒙古后缀的特点,并在条件随机现场框架中调查了三种方法下的蒙古族系统。实验表明,将每个后缀分段为单个令牌,而不是在没有分割和使用后缀作为特征的情况下实现的最佳性能。我们的方法获得了F-Measol = 82.71。它适用于蒙古大规模词汇。这项研究还对其他凝聚性语言进行了意义。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号