首页> 外文期刊>BMC Bioinformatics >PhenoGO: an integrated resource for the multiscale mining of clinical and biological data
【24h】

PhenoGO: an integrated resource for the multiscale mining of clinical and biological data

机译:PhenoGO:用于临床和生物学数据多尺度挖掘的综合资源

获取原文
           

摘要

The evolving complexity of genome-scale experiments has increasingly centralized the role of a highly computable, accurate, and comprehensive resource spanning multiple biological scales and viewpoints. To provide a resource to meet this need, we have significantly extended the PhenoGO database with gene-disease specific annotations and included an additional ten species. This a computationally-derived resource is primarily intended to provide phenotypic context (cell type, tissue, organ, and disease) for mining existing associations between gene products and GO terms specified in the Gene Ontology Databases Automated natural language processing (BioMedLEE) and computational ontology (PhenOS) methods were used to derive these relationships from the literature, expanding the database with information from ten additional species to include over 600,000 phenotypic contexts spanning eleven species from five GO annotation databases. A comprehensive evaluation evaluating the mappings ( n = 300) found precision (positive predictive value) at 85%, and recall (sensitivity) at 76%. Phenotypes are encoded in general purpose ontologies such as Cell Ontology, the Unified Medical Language System, and in specialized ontologies such as the Mouse Anatomy and the Mammalian Phenotype Ontology. A web portal has also been developed, allowing for advanced filtering and querying of the database as well as download of the entire dataset http://www.phenogo.org .
机译:基因组规模实验的不断发展的复杂性已日益集中化了跨越多种生物学规模和观点的高度可计算,准确和全面的资源。为了提供满足此需求的资源,我们已使用基因疾病特异性注释显着扩展了PhenoGO数据库,并包括了另外十个物种。此计算来源的资源主要用于提供表型上下文(细胞类型,组织,器官和疾病),用于挖掘基因产物与基因本体数据库中指定的GO术语之间的现有关联。自动自然语言处理(BioMedLEE)和计算本体(PhenOS)方法用于从文献中推导这些关系,利用来自另外十个物种的信息扩展数据库,以涵盖跨越来自五个GO注释数据库的十一个物种的超过600,000个表型上下文。评估映射(n = 300)的综合评估发现准确度(阳性预测值)为85%,召回率(敏感性)为76%。表型以通用本体(例如细胞本体,统一医学语言系统)和专用本体(例如鼠标解剖学和哺乳动物表型本体)编码。还开发了一个Web门户,可以对数据库进行高级过滤和查询,还可以下载整个数据集http://www.phenogo.org。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号