首页> 外文期刊>The American Journal of Human Genetics >Population-Scale Sequencing Data Enable Precise Estimates of Y-STR Mutation Rates
【24h】

Population-Scale Sequencing Data Enable Precise Estimates of Y-STR Mutation Rates

机译:人口规模测序数据可实现Y-STR突变率的精确估算

获取原文
获取原文并翻译 | 示例
           

摘要

Short tandem repeats (STRs) are mutation-prone loci that span nearly 1% of the human genome. Previous studies have estimated the mutation rates of highly polymorphic STRs by using capillary electrophoresis and pedigree-based designs. Although this work has provided insights into the mutational dynamics of highly mutable STRs, the mutation rates of most others remain unknown. Here, we harnessed whole-genome sequencing data to estimate the mutation rates of Y chromosome STRs (Y-STRs) with 2-6 bp repeat units that are accessible to Illumina sequencing. We genotyped 4,500 Y-STRs by using data from the 1000 Genomes Project and the Simons Genome Diversity Project. Next, we developed MUTEA, an algorithm that infers STR mutation rates from population-scale data by using a high-resolution SNP-based phylogeny. After extensive intrinsic and extrinsic validations, we harnessed MUTEA to derive mutation-rate estimates for 702 polymorphic STRs by tracing each locus over 222,000 meioses, resulting in the largest collection of Y-STR mutation rates to date. Using our estimates, we identified determinants of STR mutation rates and built a model to predict rates for STRs across the genome. These predictions indicate that the load of de novo STR mutations is at least 75 mutations per generation, rivaling the load of all other known variant types. Finally, we identified Y-STRs with potential applications in forensics and genetic genealogy, assessed the ability to differentiate between the Y chromosomes of father-son pairs, and imputed Y-STR genotypes.
机译:短串联重复序列(STR)是易突变位点,占人类基因组的近1%。以前的研究已经通过使用毛细管电泳和基于谱系的设计估计了高度多态性STR的突变率。尽管这项工作提供了对高度可变的STR的突变动力学的见解,但大多数其他突变的突变率仍然未知。在这里,我们利用全基因组测序数据来估计具有2-6 bp重复单元的Y染色体STR(Y-STR)的突变率,这些序列可用于Illumina测序。我们使用来自1000个基因组计划和西蒙斯基因组多样性计划的数据对4,500个Y-STR进行了基因分型。接下来,我们开发了MUTEA,这是一种通过使用高分辨率的基于SNP的系统发育系统从人群规模数据中推断STR突变率的算法。经过广泛的内在和外在验证后,我们利用MUTEA通过追踪222,000个meios的每个基因座,得出702个多态性STR的突变率估计值,从而形成了迄今为止最大的Y-STR突变率集合。使用我们的估计,我们确定了STR突变率的决定因素,并建立了一个模型来预测整个基因组中STR的发生率。这些预测表明,从头开始的STR突变的每一代至少有75个突变,可与所有其他已知变异类型的载荷相媲美。最后,我们确定了Y-STR在法医学和遗传谱系中的潜在应用,评估了区分父子对的Y染色体和估算的Y-STR基因型的能力。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号