首页> 外国专利> Apparatus and method for recognizing biological named entity from biological literature based on UMLS

Apparatus and method for recognizing biological named entity from biological literature based on UMLS

机译:基于UMLS的生物学文献识别生物命名实体的装置和方法

摘要

The present invention relates to an apparatus and method for recognizing biological named entity from biological literature based on united medical language system (UMLS). The apparatus and the method receives metathesaurus from the UMLS, constructs a concept name database, a single name database and a category keyterm database, which are language resources to be used recognize a named entity, receives each concept name stored in the concept name database, extracts features of each of the concept names by using data stored in the single name database and the category keyterm database, constructs a rule database by creating rules used to recognize the named entity and filtering the rules by using the extracted features, receives a biological literature, extracts nouns and noun phrases that are candidate named entities, applies the rules stored in the rule database to the nouns and the noun phrases, and recognizes the named entities. In the present invention, the biological named entities can be effectively extracted which can be used as important information individual in input literature.
机译:本发明涉及一种基于统一医学语言系统(UMLS)从生物学文献中识别生物学命名实体的设备和方法。该装置和方法从UMLS接收元同义词库,构造概念名称数据库,单一名称数据库和类别关键字数据库,它们是用于识别命名实体的语言资源,接收存储在概念名称数据库中的每个概念名称,通过使用存储在单一名称数据库和类别关键字数据库中的数据提取每个概念名称的特征,通过创建用于识别命名实体的规则并使用提取的特征过滤规则来构建规则数据库,接收生物学文献,提取作为候选命名实体的名词和名词短语,将规则数据库中存储的规则应用于名词和名词短语,并识别命名实体。在本发明中,可以有效地提取生物学命名实体,其可以用作输入文献中的重要信息个体。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号