首页> 外文学位 >Between genes and phenotypes: An integrative network-based Monte Carlo method for the prediction of human gene-phenotype associations.
【24h】

Between genes and phenotypes: An integrative network-based Monte Carlo method for the prediction of human gene-phenotype associations.

机译:在基因和表型之间:基于集成网络的蒙特卡洛方法,用于预测人类基因-表型的关联。

获取原文
获取原文并翻译 | 示例

摘要

Complex human diseases are often caused by multiple mutations, each of which contributes only a minor effect to the disease phenotype. To study the basis for these complex phenotypes, we developed a network-based approach to identify coexpression modules specifically activated in particular phenotypes. We integrated these modules, protein-protein interaction data, Gene Ontology annotations, and our database of gene-phenotype associations derived from literature to predict novel human gene-phenotype associations. Our systematic predictions provide us with the opportunity to perform a global analysis of human gene pleiotropy and its underlying regulatory mechanisms.;We applied this method to 338 microarray datasets, covering 178 phenotype classes, and identified 193,145 phenotype-specific coexpression modules. We trained random forest classifiers for each phenotype and predicted a total of 6,558 gene-phenotype associations. We showed that 40.9% genes are pleiotropic, highlighting that pleiotropy is more prevalent than previously expected. We collected 77 ChIP-chip datasets studying 69 transcription factors binding over 16,000 targets under various phenotypic conditions. Utilizing this unique data source, we confirmed that dynamic transcriptional regulation is an important force driving the formation of phenotype specific gene modules.;We created a genome-wide gene to phenotype mapping that has many potential implications, including providing potential new drug targets and uncovering the basis for human disease phenotypes. Our analysis of these phenotype-specific coexpression modules reveals a high prevalence of gene pleiotropy, and suggests that phenotype-specific transcription factor binding may contribute to phenotypic diversity. All resources from our study are made freely available on our online Phenotype Prediction Database.
机译:复杂的人类疾病通常是由多种突变引起的,每种突变对疾病表型的影响都很小。为了研究这些复杂表型的基础,我们开发了一种基于网络的方法来识别在特定表型中特别激活的共表达模块。我们整合了这些模块,蛋白质-蛋白质相互作用数据,基因本体论注释以及我们从文献衍生的基因-表型关联数据库,以预测新型人类基因-表型关联。我们的系统化预测为我们提供了对人类基因多效性及其潜在调控机制进行全局分析的机会。我们将该方法应用于338个微阵列数据集,涵盖178个表型类别,并确定了193,145个表型特异性共表达模块。我们为每种表型训练了随机森林分类器,并预测了总共6,558个基因-表型关联。我们显示40.9%的基因是多效性的,这说明多效性比以前预期的更为普遍。我们收集了77个ChIP芯片数据集,研究了69个在不同表型条件下结合16,000个靶标的转录因子。利用这一独特的数据源,我们证实了动态转录调控是驱动表型特异性基因模块形成的重要力量。;我们创建了一个全基因组的基因到表型作图,具有许多潜在的意义,包括提供潜在的新药物靶标和发现人类疾病表型的基础。我们对这些表型特异性共表达模块的分析揭示了基因多效性的高患病率,并表明表型特异性转录因子的结合可能有助于表型多样性。我们研究的所有资源均可在我们的在线表型预测数据库上免费获得。

著录项

  • 作者

    R. Mehan, Michael.;

  • 作者单位

    University of Southern California.;

  • 授予单位 University of Southern California.;
  • 学科 Biology Bioinformatics.
  • 学位 Ph.D.
  • 年度 2009
  • 页码 146 p.
  • 总页数 146
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

  • 入库时间 2022-08-17 11:38:23

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号