【24h】

The DIEGO Lab Graph Based Gene Normalization System

机译:基于DIEGO Lab Graph的基因标准化系统

获取原文

摘要

Gene entity normalization, the mapping of a gene mention in free text to a unique identifier, is one of the primary subtasks in the biomedical information extraction pipeline. Gene entity normalization provides many challenges, specifically with the high ambiguity of gene names and the many-to-many relationship between gene names and identifiers. Drawing inspiration from recent work in word sense disambiguation, this paper presents a gene entity normalization system based on entity relationship graphs. This system creates a concept graph from the possible entities and their relationships within a full-text document, and takes advantage of a node ranking algorithm to rank and score each potential candidate entity. This system is a prototype to represent a specific approach to gene normalization, and the results reflect this. However, this system demonstrates that the relationship graph-based approach, an approach grounded in a theoretical basis, can potentially be useful for gene normalization and possibly for the normalization of various biomedical entities.
机译:基因实体归一化,即在自由文本中提及的基因到唯一标识符的映射,是生物医学信息提取管道中的主要子任务之一。基因实体规范化带来了许多挑战,特别是基因名称的歧义性以及基因名称与标识符之间的多对多关系。借鉴最近在词义歧义研究中的启发,提出了一种基于实体关系图的基因实体归一化系统。该系统根据全文文档中可能的实体及其关系创建概念图,并利用节点排名算法对每个潜在的候选实体进行排名和评分。该系统是一个原型,代表了一种特定的基因标准化方法,结果反映了这一点。但是,该系统证明基于关系图的方法(一种基于理论的方法)可能对基因标准化以及对各种生物医学实体的标准化很有用。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号