...
首页> 外文期刊>The American Journal of Human Genetics >A Statistical Framework for Mapping Risk Genes from De Novo Mutations in Whole-Genome-Sequencing Studies
【24h】

A Statistical Framework for Mapping Risk Genes from De Novo Mutations in Whole-Genome-Sequencing Studies

机译:来自全基因组排序研究中的Novo突变的统计框架映射来自Novo突变

获取原文
获取原文并翻译 | 示例
   

获取外文期刊封面封底 >>

       

摘要

Analysis of de novo mutations (DNMs) from sequencing data of nuclear families has identified risk genes for many complex diseases, including multiple neurodevelopmental and psychiatric disorders. Most of these efforts have focused on mutations in protein-coding sequences. Evidence from genome-wide association studies (GWASs) strongly suggests that variants important to human diseases often lie in non-coding regions. Extending DNM-based approaches to non-coding sequences is challenging, however, because the functional significance of non-coding mutations is difficult to predict. We propose a statistical framework for analyzing DNMs from whole-genome sequencing (WGS) data. This method, TADA-Annotations (TADA-A), is a major advance of the TADA method we developed earlier for DNM analysis in coding regions. TADA-A is able to incorporate many functional annotations such as conservation and enhancer marks, to learn from data which annotations are informative of pathogenic mutations, and to combine both coding and non-coding mutations at the gene level to detect risk genes. It also supports meta-analysis of multiple DNM studies, while adjusting for study-specific technical effects. We applied TADA-A to WGS data of similar to 300 autism-affected family trios across five studies and discovered several autism risk genes. The software is freely available for all research uses.
机译:从核家族的测序数据分析De Novo突变(DNMS)已经确定了许多复杂疾病的风险基因,包括多种神经发育和精神病疾病。这些努力中的大部分都集中在蛋白质编码序列中的突变。来自基因组关联研究(GWASS)的证据强烈表明对人类疾病重要的变体通常位于非编码区域。然而,扩展基于DNM的非编码序列的方法是具有挑战性的,因为非编码突变的功能意义难以预测。我们提出了一种统计框架,用于分析来自全基因组测序(WGS)数据的DNMS。这种方法,TADA - 注释(TADA-A),是我们在编码区中提前开发的TADA方法的主要进展。 TADA-A能够纳入许多功能注释,例如保护和增强子标记,以学习注释是致病性突变的信息,并将编码和非编码突变结合在基因水​​平上以检测风险基因。它还支持多个DNM研究的Meta分析,同时调整学习特定的技术效果。我们将TADA-A应用于类似于300项自闭症影响的家庭三项研究的WGS数据,并发现了几种自闭症风险基因。软件可用于所有研究用途。

著录项

  • 来源
  • 作者单位

    Univ Chicago Dept Human Genet Chicago IL 60637 USA;

    Carnegie Mellon Univ Sch Comp Sci Computat Biol Dept Pittsburgh PA 15123 USA;

    Carnegie Mellon Univ Sch Comp Sci Computat Biol Dept Pittsburgh PA 15123 USA;

    Wenzhou Med Univ Inst Genom Med Wenzhou 325000 Zhejiang Peoples R China;

    Cent S Univ Ctr Med Genet Sch Life Sci Changsha 410078 Hunan Peoples R China;

    Yale Med Child Study Ctr New Haven CT 06520 USA;

    Yale Sch Med Kavli Inst Neurosci New Haven CT 06520 USA;

    Chinese Acad Sci Beijing Inst Life Sci Beijing 100000 Peoples R China;

    Chinese Acad Sci Beijing Inst Life Sci Beijing 100000 Peoples R China;

    Univ Chicago Comm Genet Genom &

    Syst Biol Chicago IL 60637 USA;

    Univ Chicago Dept Human Genet Chicago IL 60637 USA;

    Univ Chicago Dept Human Genet Chicago IL 60637 USA;

    Wenzhou Med Univ Inst Genom Med Wenzhou 325000 Zhejiang Peoples R China;

    Yale Sch Med Dept Genet New Haven CT 06520 USA;

    Columbia Univ Dept Biostat New York NY 10027 USA;

    Chinese Acad Sci Beijing Inst Life Sci Beijing 100000 Peoples R China;

    Cent S Univ Ctr Med Genet Sch Life Sci Changsha 410078 Hunan Peoples R China;

    Yale Sch Med Dept Genet New Haven CT 06520 USA;

    Chinese Acad Sci Beijing Inst Life Sci Beijing 100000 Peoples R China;

    Univ Chicago Dept Human Genet Chicago IL 60637 USA;

  • 收录信息
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类 医学遗传学;
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号