首页> 美国卫生研究院文献>Genome Research >Evaluation of GRCh38 and de novo haploid genome assemblies demonstrates the enduring quality of the reference assembly
【2h】

Evaluation of GRCh38 and de novo haploid genome assemblies demonstrates the enduring quality of the reference assembly

机译:对GRCh38和从头单倍体基因组装配的评估证明了参考装配的持久质量

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

The human reference genome assembly plays a central role in nearly all aspects of today's basic and clinical research. GRCh38 is the first coordinate-changing assembly update since 2009; it reflects the resolution of roughly 1000 issues and encompasses modifications ranging from thousands of single base changes to megabase-scale path reorganizations, gap closures, and localization of previously orphaned sequences. We developed a new approach to sequence generation for targeted base updates and used data from new genome mapping technologies and single haplotype resources to identify and resolve larger assembly issues. For the first time, the reference assembly contains sequence-based representations for the centromeres. We also expanded the number of alternate loci to create a reference that provides a more robust representation of human population variation. We demonstrate that the updates render the reference an improved annotation substrate, alter read alignments in unchanged regions, and impact variant interpretation at clinically relevant loci. We additionally evaluated a collection of new de novo long-read haploid assemblies and conclude that although the new assemblies compare favorably to the reference with respect to continuity, error rate, and gene completeness, the reference still provides the best representation for complex genomic regions and coding sequences. We assert that the collected updates in GRCh38 make the newer assembly a more robust substrate for comprehensive analyses that will promote our understanding of human biology and advance our efforts to improve health.
机译:人类参考基因组装配在当今的基础和临床研究的几乎所有方面都发挥着核心作用。 GRCh38是自2009年以来的第一个坐标更改装配更新;它反映了大约1000个问题的解决方案,涵盖了从数千个单碱基更改到兆碱基规模的路径重组,空位闭合以及以前孤立序列的本地化的各种修改。我们开发了一种针对目标碱基更新的序列生成新方法,并使用了来自新基因组作图技术和单一单倍型资源的数据来识别和解决较大的装配问题。引用程序集首次包含着丝粒的基于序列的表示形式。我们还扩展了备用基因座的数量,以创建一个参考,以更可靠地表示人口变化。我们证明了这些更新使参考文献成为了一种改进的注释底物​​,改变了未改变区域的阅读序列,并影响了临床相关基因座的变异解释。我们还评估了一组新的从头阅读的单倍体组装,并得出结论,尽管就连续性,错误率和基因完整性而言,新组装与参考文献相比具有优势,但该参考文献仍能为复杂的基因组区域提供最佳的表征。编码序列。我们断言GRCh38中收集的更新使新的装配体成为进行全面分析的更强大的基础,这将增进我们对人类生物学的了解并推动我们改善健康的努力。

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号