首页> 外文会议>International Work-Conference on Bioinformatics and Biomedical Engineering >Prioritization of Schizophrenia Risk Genes by a Network-Regularized Logistic Regression Method
【24h】

Prioritization of Schizophrenia Risk Genes by a Network-Regularized Logistic Regression Method

机译:通过网络正规的逻辑回归方法进行精神分裂症风险基因的优先次序

获取原文

摘要

Schizophrenia (SCZ) is a severe mental disorder with a large genetic component. While recent large-scale microarray- and sequencing-based genome wide association studies have made significant progress toward finding SCZ risk variants and genes of subtle effect, the interactions among them were not considered in those studies. Using a protein-protein interaction network both in our regression model and to generate a SCZ gene subnetwork, we developed an analytical framework with Logit-Lapnet, the graphical Laplacian-regularized logistic regression, for whole exome sequencing (WES) data analysis to detect SCZ gene subnetworks. Using simulated data from sequencing-based association study, we compared the performances of Logit-Lapnet with other logistic regression (LR)-based models. We use Logit-Lapnet to prioritize genes according to their coefficients and select top-ranked genes as seeds to generate the gene sub-network that is associated to SCZ. The comparison demonstrated not only the applicability but also better performance of Logit-Lapnet to score disease risk genes using sequencing-based association data. We applied our method to SCZ whole exome sequencing data and selected top-ranked risk genes, the majority of which are either known SCZ genes or genes potentially associated with SCZ. We then used the seed genes to construct SCZ gene subnetworks. This result demonstrates that by ranking gene according to their disease contributions our method scores and thus prioritizes disease risk genes for further investigation. An implementation of our approach in MATLAB is freely available for download at: http://zdzlab.einstein.yu.edu/1/publications/LapNet-MATLAB.zip.
机译:精神分裂症(SCZ)是一种严重的精神障碍,具有大型遗传组分。虽然近期大规模的微阵列和排序基因组宽协会研究对寻找SCZ风险变异和微妙效果的基因进行了重大进展,但在这些研究中不考虑它们之间的相互作用。在我们的回归模型中使用蛋白质 - 蛋白质相互作用网络并生成SCZ基因子网络,我们开发了一种与LogIT-Lapnet的分析框架,图形Laplacian-Rengeyized Logistic回归,用于全外壳测序(WES)数据分析来检测SCZ基因子网。使用从基于排序的关联研究中的模拟数据,与其他逻辑回归(LR)的模型进行了比较了LogIT-Lapnet的性能。我们使用Logit-LapNet根据其系数优先考虑基因,并选择排名基因作为种子以生成与SCZ相关的基因子网。比较不仅证明了使用基于测序的关联数据的LogIT-Lapnet的适用性而且更好地表现了Logit-Lapnet的性能。我们将我们的方法应用于SCZ整体exome测序数据和选择的排名越野风险基因,其中大多数是已知的SCZ基因或可能与SCZ相关的基因。然后我们使用种子基因来构建SCZ基因子网。该结果表明,通过根据疾病的贡献排序基因,我们的方法评分并因此优先考虑疾病风险基因进行进一步调查。我们在Matlab中的方法可以自由地下载:http://zdzlab.einstein.yu.edu/1/1/publications/lapnet-matlab.zip。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号