...
首页> 外文期刊>Journal of Theoretical Biology >Mining susceptibility gene modules and disease risk genes from SNP data by combining network topological properties with support vector regression
【24h】

Mining susceptibility gene modules and disease risk genes from SNP data by combining network topological properties with support vector regression

机译:通过将网络拓扑特性与支持向量回归相结合,从SNP数据中挖掘易感基因模块和疾病风险基因

获取原文
获取原文并翻译 | 示例
           

摘要

Genome-wide association study is a powerful approach to identify disease risk loci. However, the molecular regulatory mechanisms for most complex diseases are still not well understood. Therefore, further investigating the interplay between genetic factors and biological networks is important for elucidating the molecular mechanisms of complex diseases. Here, we proposed a novel framework to identify susceptibility gene modules and disease risk genes by combining network topological properties with support vector regression from single nucleotide polymorphism (SNP) level. We assigned risk SNPs to genes using the University of California at Santa Cruz (UCSC) genome database, and then mapped these genes to protein-protein interaction (PPI) networks. The gene modules implicated by hub genes were extracted using the PPI networks and the topological property was analyzed for these gene modules. For each gene module, risk feature genes were determined by topological property analysis and support vector regression. As a result, five shared risk feature genes, CD80, EGFR, FN1, GSK3B and TRAF6 were found and proven to be associated with rheumatoid arthritis by previous reports. Our approach showed a good performance in comparison with other approaches and can be used for prioritizing candidate genes associated with complex diseases.
机译:全基因组关联研究是确定疾病风险基因座的有效方法。然而,对于大多数复杂疾病的分子调控机制仍不十分了解。因此,进一步研究遗传因素和生物网络之间的相互作用对于阐明复杂疾病的分子机制很重要。在这里,我们提出了一个新颖的框架,通过将网络拓扑特性与支持向量从单核苷酸多态性(SNP)水平回归而结合起来,来识别易感基因模块和疾病风险基因。我们使用加利福尼亚大学圣克鲁斯分校(UCSC)基因组数据库为基因分配了风险SNP,然后将这些基因映射到蛋白质-蛋白质相互作用(PPI)网络。使用PPI网络提取由集线器基因牵连的基因模块,并分析这些基因模块的拓扑特性。对于每个基因模块,通过拓扑特性分析和支持向量回归确定风险特征基因。结果,发现了五个共有的风险特征基因,即CD80,EGFR,FN1,GSK3B和TRAF6,并被先前的报道证明与类风湿关节炎有关。与其他方法相比,我们的方法显示出良好的性能,可用于对与复杂疾病相关的候选基因进行优先排序。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号