...
首页> 外文期刊>BMC Medical Genomics >Integrating human omics data to prioritize candidate genes
【24h】

Integrating human omics data to prioritize candidate genes

机译:整合人类组学数据以优先考虑候选基因

获取原文
   

获取外文期刊封面封底 >>

       

摘要

Background The identification of genes involved in human complex diseases remains a great challenge in computational systems biology. Although methods have been developed to use disease phenotypic similarities with a protein-protein interaction network for the prioritization of candidate genes, other valuable omics data sources have been largely overlooked in these methods. Methods With this understanding, we proposed a method called BRIDGE to prioritize candidate genes by integrating disease phenotypic similarities with such omics data as protein-protein interactions, gene sequence similarities, gene expression patterns, gene ontology annotations, and gene pathway memberships. BRIDGE utilizes a multiple regression model with lasso penalty to automatically weight different data sources and is capable of discovering genes associated with diseases whose genetic bases are completely unknown. Results We conducted large-scale cross-validation experiments and demonstrated that more than 60% known disease genes can be ranked top one by BRIDGE in simulated linkage intervals, suggesting the superior performance of this method. We further performed two comprehensive case studies by applying BRIDGE to predict novel genes and transcriptional networks involved in obesity and type II diabetes. Conclusion The proposed method provides an effective and scalable way for integrating multi omics data to infer disease genes. Further applications of BRIDGE will be benefit to providing novel disease genes and underlying mechanisms of human diseases.
机译:背景技术与人类复杂疾病有关的基因的鉴定仍然是计算系统生物学中的巨大挑战。尽管已经开发出使用疾病表型相似性和蛋白质-蛋白质相互作用网络来确定候选基因优先级的方法,但是在这些方法中,其他有价值的组学数据来源却被大大忽略了。方法基于这种理解,我们提出了一种称为BRIDGE的方法,该方法通过将疾病的表型相似性与蛋白质组相互作用,基因序列相似性,基因表达模式,基因本体注释和基因途径成员等组学数据相集成来对候选基因进行优先排序。 BRIDGE利用带有套索罚分的多元回归模型来自动加权不同的数据源,并能够发现与遗传基础完全未知的疾病相关的基因。结果我们进行了大规模的交叉验证实验,结果表明,超过60%的已知疾病基因在模拟的连锁间隔中可以被BRIDGE排名第一,表明该方法的优越性。通过应用BRIDGE预测肥胖和II型糖尿病的新基因和转录网络,我们进一步进行了两个全面的案例研究。结论所提出的方法为整合多组学数据以推断疾病基因提供了一种有效且可扩展的方法。 BRIDGE的进一步应用将有益于提供新型疾病基因和人类疾病的潜在机制。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号