首页> 外文OA文献 >Sequencing and de novo assembly of 150 genomes from Denmark as a population reference
【2h】

Sequencing and de novo assembly of 150 genomes from Denmark as a population reference

机译:来自丹麦的150个基因组的测序和从头组装作为群体参考

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

Hundreds of thousands of human genomes are now being sequenced to characterize genetic variation and use this information to augment association mapping studies of complex disorders and other phenotypic traits. Genetic variation is identified mainly by mapping short reads to the reference genome or by performing local assembly. However, these approaches are biased against discovery of structural variants and variation in the more complex parts of the genome. Hence, large-scale de novo assembly is needed. Here we show that it is possible to construct excellent de novo assemblies from high-coverage sequencing with mate-pair libraries extending up to 20 kilobases. We report de novo assemblies of 150 individuals (50 trios) from the GenomeDenmark project. The quality of these assemblies is similar to those obtained using the more expensive long-read technology. We use the assemblies to identify a rich set of structural variants including many novel insertions and demonstrate how this variant catalogue enables further deciphering of known association mapping signals. We leverage the assemblies to provide 100 completely resolved major histocompatibility complex haplotypes and to resolve major parts of the Y chromosome. Our study provides a regional reference genome that we expect will improve the power of future association mapping studies and hence pave the way for precision medicine initiatives, which now are being launched in many countries including Denmark.
机译:现在正在对数十万个人类基因组进行测序,以表征遗传变异并使用此信息来增强对复杂疾病和其他表型性状的关联作图研究。遗传变异主要通过将短读图映射到参考基因组或通过进行局部装配来鉴定。但是,这些方法偏向于发现结构变异和基因组更复杂部分的变异。因此,需要大规模的从头组装。在这里,我们表明可以通过覆盖范围高达20千碱基对的伴侣对文库,从高覆盖率的测序中构建出色的从头组装。我们报告了来自GenomeDenmark项目的150个参与者(50个三重奏)的从头汇编。这些组件的质量类似于使用更昂贵的长读技术获得的组件。我们使用这些程序集来识别一组丰富的结构变异,包括许多新颖的插入物,并演示此变异目录如何实现对已知关联映射信号的进一步解密。我们利用程序集提供100种完全解析的主要组织相容性复杂单倍型并解析Y染色体的主要部分。我们的研究提供了一个区域参考基因组,我们期望该基因组将改善未来的关联图谱研究的能力,从而为精密医学计划铺平道路,目前在包括丹麦在内的许多国家已经启动了精确医学计划。

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号