首页> 外文期刊>BMC Bioinformatics >Integrative analysis and machine learning on cancer genomics data using the Cancer Systems Biology Database (CancerSysDB)
【24h】

Integrative analysis and machine learning on cancer genomics data using the Cancer Systems Biology Database (CancerSysDB)

机译:使用癌症系统生物学数据库(CancerSysDB)对癌症基因组学数据进行综合分析和机器学习

获取原文
           

摘要

Recent cancer genome studies on many human cancer types have relied on multiple molecular high-throughput technologies. Given the vast amount of data that has been generated, there are surprisingly few databases which facilitate access to these data and make them available for flexible analysis queries in the broad research community. If used in their entirety and provided at a high structural level, these data can be directed into constantly increasing databases which bear an enormous potential to serve as a basis for machine learning technologies with the goal to support research and healthcare with predictions of clinically relevant traits. We have developed the Cancer Systems Biology Database (CancerSysDB), a resource for highly flexible queries and analysis of cancer-related data across multiple data types and multiple studies. The CancerSysDB can be adopted by any center for the organization of their locally acquired data and its integration with publicly available data from multiple studies. A publicly available main instance of the CancerSysDB can be used to obtain highly flexible queries across multiple data types as shown by highly relevant use cases. In addition, we demonstrate how the CancerSysDB can be used for predictive cancer classification based on whole-exome data from 9091 patients in The Cancer Genome Atlas (TCGA) research network. Our database bears the potential to be used for large-scale integrative queries and predictive analytics of clinically relevant traits.
机译:最近对许多人类癌症类型的癌症基因组研究依赖于多种分子高通量技术。鉴于已生成大量数据,令人惊讶的是,很少有数据库可以促进对这些数据的访问,并使它们可用于广泛的研究社区中的灵活分析查询。如果将其整体使用并以较高的结构水平提供,则可以将这些数据定向到不断增加的数据库中,这些数据库具有巨大的潜力,可作为机器学习技术的基础,旨在通过对临床相关性状的预测来支持研究和医疗保健。我们已经开发了癌症系统生物学数据库(CancerSysDB),该资源用于高度灵活地查询和分析多种数据类型和多项研究中与癌症相关的数据。任何中心都可以采用CancerSysDB来组织其本地获取的数据,并将其与来自多个研究的公开可用数据集成。如高度相关的用例所示,CancerSysDB的公共可用主要实例可用于跨多种数据类型获取高度灵活的查询。此外,我们根据癌症基因组图谱(TCGA)研究网络中来自9091名患者的全外显子组数据,展示了如何将CancerSysDB用于预测癌症分类。我们的数据库具有用于大规模综合查询和临床相关性状预测分析的潜力。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号