首页> 外文会议>International conference on computational linguistics >Mongolian Named Entity Recognition System with Rich Features
【24h】

Mongolian Named Entity Recognition System with Rich Features

机译:蒙古人命名具有丰富功能的实体识别系统

获取原文

摘要

In this paper, we first build a manually annotated named entity corpus of Mongolian. Then, we propose three morphological processing methods and study comprehensive features, including syllable features, lexical features, context features, morphological features and semantic features in Mongolian named entity recognition. Moreover, we also evaluate the influence of word cluster features on the system and combine all features together eventually. The experimental result shows that segmenting each suffix into an individual token achieves better results than deleting suffixes or using the suffixes as feature. The system based on segmenting suffixes with all proposed features yields benchmark result of F-measure=84.65 on this corpus.
机译:在本文中,我们首先建立一个手动注释的名为蒙古族实体语料库。然后,我们提出了三种形态处理方法和研究综合特征,包括蒙古族名称实体识别的音节特征,词汇特征,语言特征,形态特征和语义特征。此外,我们还评估了系统上的单词群集功能的影响,并最终将所有功能组合在一起。实验结果表明,将每个后缀分割成单个令牌,而不是删除后缀或使用后缀作为特征来实现更好的结果。基于分段后缀的系统,具有所有提出的特征,产生了该语料库上的F-Measure = 84.65的基准结果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号