首页> 外文会议>PRICAI 2010: Trends in artificial intelligence >Chinese Named Entity Recognition Based on Hierarchical Hybrid Model
【24h】

Chinese Named Entity Recognition Based on Hierarchical Hybrid Model

机译:基于层次混合模型的中文命名实体识别

获取原文
获取原文并翻译 | 示例

摘要

Chinese named entity recognition is a challenging, difficult, yet important task in natural language processing. This paper presents a novel approach based on a hierarchical hybrid model to recognize Chinese named entities. Three mutually dependent stages- boosting, Markov Logic Networks (MLNs) based recognition, and abbreviation detection - are integrated in the model. AdaBoost algorithm is utilized for fast recognition of simple named entities first. More complex named entities are then piped into MLNs for accurate recognition. In particular, the left boundary recognition of named entities is considered. Lastly, special care is taken for classifying the abbreviated named entities by using the global context information in the same document. Experiments were conducted on People's Daily corpus. The results show that our approach can improve the performance significantly with precision of 94.38%, recall of 93.89%, and F_(β=1) value of 93.97%.
机译:在自然语言处理中,中文命名实体识别是一项具有挑战性,困难但重要的任务。本文提出了一种基于层次混合模型的新方法来识别中文命名实体。该模型集成了三个相互依赖的阶段-增强,基于马尔可夫逻辑网络(MLN)的识别和缩写检测。 AdaBoost算法首先用于快速识别简单的命名实体。然后,将更复杂的命名实体传递到MLN中以进行准确识别。特别是考虑了命名实体的左边界识别。最后,要特别注意通过使用同一文档中的全局上下文信息对缩写的命名实体进行分类。在《人民日报》语料库上进行了实验。结果表明,我们的方法可以显着提高性能,精度为94.38%,召回率为93.89%,F_(β= 1)值为93.97%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号