首页> 外文会议>IEEE International Conference on e-Science >Generating knowledge networks from phenotypic descriptions
【24h】

Generating knowledge networks from phenotypic descriptions

机译:从表型描述生成知识网络

获取原文

摘要

Several computing systems rely on information about living beings, such as Identification Keys — artifacts created by biologists to identify specimens following a flow of questions about their observable characters (phenotype). These questions are described in a free-text format, e.g., “big and black eye”. Free-texts hamper the automatic information interpretation by machines, limiting their ability to perform search and comparison of terms, as well as integration tasks. This paper proposes a method to extract phenotypic information from natural language texts from biology legacy information systems, transforming them in an Entity-Quality formalism — a format to represent each phenotype character (Entity) and its state (Quality). Our approach aligns automatically recognized. Entities and Qualities with domain concepts described in ontologies. It adopts existing Natural Language Processing techniques, adding an extra original step, which exploits intrinsic characteristics of phenotypic descriptions and of the organizational structure of Identification Keys. The approach was validated over the FishBase data. We conducted extensive experiments based on a manually annotated Gold Standard set to assess the precision and applicability of the proposed extraction method. The obtained results reveal the feasibility of our technique, its benefits and possibilities of scientific studies using the extracted knowledge network.
机译:几个计算系统依赖有关生物的信息,例如由生物学家创建的识别键 - 识别关于其可观察性角色(表型)的问题流动之后的标本。这些问题以自由文本格式描述,例如,“大黑眼睛”。自由文本妨碍机器的自动信息解释,限制了他们执行术语的搜索和比较的能力,以及集成任务。本文提出了一种从生物学传统信息系统中从自然语言文本中提取表型信息的方法,将它们变换为实体 - 质量形式主义 - 一种表示每个表型字符(实体)及其状态(质量)的格式。我们的方法对齐自动识别。在本体中描述的域概念的实体和品质。它采用现有的自然语言处理技术,增加了额外的原始步骤,该步骤利用了表型描述的内在特征和识别键的组织结构。该方法在鱼贝数据上验证。我们根据手动注释的金标准进行了广泛的实验,以评估所提出的提取方法的精确和适用性。所获得的结果揭示了我们技术的可行性,利用提取的知识网络的科学研究的益处和可能性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号