首页> 外文会议>IEEE International Conference on Software Engineering and Service Science >Research on Entity Recognition Based on Multi-criteria Fusion Model
【24h】

Research on Entity Recognition Based on Multi-criteria Fusion Model

机译:基于多标准融合模型的实体识别研究

获取原文

摘要

In order to fully and comprehensively utilize the Chinese named entity recognition corpus marked according to different labeling criteria, this paper, by applying the word-based BERT bidirectional language model, introducing multi-criteria shared connection layer and conditional random field (CRF) system, and using Microsoft Research Asia MSRA-NER corpus and Peking University’s People’s Daily part-of-speech tagging corpus (RMRB-98-1), firstly formed each Chinese named entity recognition model separately, and then mixed a multi-criteria fusion model with two corpora. Experiments show that the recognition effect of multi-criteria fusion model is better than that of each corpus independent model, reaching 94.46% F1 value and 94.32% F1 value respectively on MSRA-NER and RMRB-98-1 corpus. However, the experiment still has its limit in the scale of the corpus involved in the fusion. Later, integration of more corpora, and combination with entity recognition tasks in specific fields such as biology and military will be further explored to enhance the recognition effect.
机译:为了完全和全面地利用根据不同标签标准标记的中文名称实体识别语料库,本文通过应用基于Word的BERT双向语言模型,引入多标准共享连接层和条件随机字段(CRF)系统,并使用Microsoft Research Asia Msra-Ner语料库和北京大学的人民每日演讲标记语料库(RMRB-98-1),首先形成了每个名称的实体识别模型,然后用两个混合多标准融合模型Corpora。实验表明,多标准融合模型的识别效果优于每个语料库独立模型的识别效果,分别达到MSRA-ner和RMRB-98-1语料库的94.46%F1值和94.32%F1值。然而,实验仍然是融合中涉及的语料库的限制。后来,将进一步探索更多GRACKA的整合以及与特定领域的实体识别任务组合,以提高识别效果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号