首页> 中文期刊>中文信息学报 >面向专业文献知识实体类型的抽取和标注

面向专业文献知识实体类型的抽取和标注

     

摘要

Knowledge-entity type labeling is important for the structural management of literature data.However, since the knowledge entities are highly specialized and have diversified types,traditional entity-extraction and labe-ling methods do not produce good results on the literature data.To solve this problem,we summarize several char-acteristics of knowledge-entity by exploring the literature data.And then according to these characteristics,we pro-pose a combination of unsupervised and semi-supervised method,w hich is based on some heuristic rules and multi-label weighted LPA propagation.This method is able to extract candidate labels from the data and does the knowl-edge-entity labeling work without manual annotation.Experimental results demonstrate that the proposed method is flexible,and more suitable for the literature data.%知识实体的类型标注是专业文献的结构化管理和知识脉络挖掘中的一个重要问题.然而,由于知识实体具有专业性强、类型多样等特点,传统的实体抽取方法并不能很好地实现知识实体的类型标注.为了解决这一问题,该文从数据中发现并总结出知识实体类型的独有特性,根据这些特性首先提出一种基于启发式规则的类型抽取方法、实现部分知识实体的类型标注,进而通过多标签加权的标签传播方法实现对所有知识实体的类型标注.与传统方法相比,该方法能够从数据中获得最有可能的类型标签,在无需人工标注的情况下获得有效的知识实体类型标注.实验结果表明,所提出方法具有较好的灵活性,更适用于专业文献知识实体的类型标注.

著录项

相似文献

  • 中文文献
  • 外文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号