...
首页> 外文期刊>Genome research >Efficient and unique cobarcoding of second-generation sequencing reads from long DNA molecules enabling cost-effective and accurate sequencing, haplotyping, and de novo assembly
【24h】

Efficient and unique cobarcoding of second-generation sequencing reads from long DNA molecules enabling cost-effective and accurate sequencing, haplotyping, and de novo assembly

机译:高DNA分子的高效且独特的二代测序读取读取,从而实现具有成本效益和准确的测序,单倍型和De Novo组装

获取原文
获取原文并翻译 | 示例
           

摘要

Here, we describe single-tube long fragment read (stLFR), a technology that enables sequencing of data from long DNA molecules using economical second-generation sequencing technology. It is based on adding the same barcode sequence to subfragments of the original long DNA molecule (DNA cobarcoding). To achieve this efficiently, stLFR uses the surface of microbeads to create millions of miniaturized barcoding reactions in a single tube. Using a combinatorial process, up to 3.6 billion unique barcode sequences were generated on beads, enabling practically nonredundant cobarcoding with 50 million barcodes per sample. Using stLFR, we demonstrate efficient unique cobarcoding of more than 8 million 20- to 300-kb genomic DNA fragments. Analysis of the human genome NA12878 with stLFR demonstrated high-quality variant calling and phase block lengths up to N50 34 Mb. We also demonstrate detection of complex structural variants and complete diploid de novo assembly of NA12878. These analyses were all performed using single stLFR libraries, and their construction did not significantly add to the time or cost of whole-genome sequencing (WGS) library preparation. stLFR represents an easily automatable solution that enables high-quality sequencing, phasing, SV detection, scaffolding, cost-effective diploid de novo genome assembly, and other long DNA sequencing applications.
机译:在这里,我们描述单管长片段读取(STLFR),一种技术,其能够使用经济的第二代排序技术从LONG DNA分子中排序数据。它基于将相同的条形码序列添加到原始的Long DNA分子(DNA Cobarcoding)的子帧中。为了有效地实现这一目标,STLFR使用微珠的表面来在单个管中产生数百万小型化的条形码反应。使用组合过程,在珠子上产生高达36亿个独特的条形码序列,实际上是每种样本的50百万条形码的非冗余的COBARCODING。使用STLFR,我们展示了超过800万至300 kB基因组DNA片段的高效独特的Cobarcoding。用STLFR分析人类基因组NA12878,STLFR展示了高质量的变体呼叫和相块长达N50 34 MB的相位块。我们还证明了检测到复杂的结构变体和NA12878的完整二倍体De Novo集会。这些分析全部使用单一的STLFR文库进行,并且它们的结构没有显着增加全基因组测序(WGS)文库制备的时间或成本。 STLFR表示易于自动的解决方案,可实现高质量排序,分阶段,SV检测,脚手架,经济高效的二倍体DE Novo基因组组装和其他长DNA测序应用。

著录项

  • 来源
    《Genome research》 |2019年第5期|共11页
  • 作者单位

    BGI Shenzhen Shenzhen 518083 Peoples R China;

    Complete Genom Inc Adv Genom Technol Lab San Jose CA 95134 USA;

    BGI Shenzhen Shenzhen 518083 Peoples R China;

    Complete Genom Inc Adv Genom Technol Lab San Jose CA 95134 USA;

    Complete Genom Inc Adv Genom Technol Lab San Jose CA 95134 USA;

    BGI Shenzhen MGI Shenzhen 518083 Peoples R China;

    BGI Shenzhen Shenzhen 518083 Peoples R China;

    Complete Genom Inc Adv Genom Technol Lab San Jose CA 95134 USA;

    Complete Genom Inc Adv Genom Technol Lab San Jose CA 95134 USA;

    BGI Shenzhen Shenzhen 518083 Peoples R China;

    BGI Shenzhen Shenzhen 518083 Peoples R China;

    BGI Shenzhen Shenzhen 518083 Peoples R China;

    BGI Shenzhen Shenzhen 518083 Peoples R China;

    BGI Shenzhen Shenzhen 518083 Peoples R China;

    BGI Shenzhen MGI Shenzhen 518083 Peoples R China;

    Complete Genom Inc Adv Genom Technol Lab San Jose CA 95134 USA;

    Complete Genom Inc Adv Genom Technol Lab San Jose CA 95134 USA;

    Complete Genom Inc Adv Genom Technol Lab San Jose CA 95134 USA;

    BGI Shenzhen Shenzhen 518083 Peoples R China;

    Complete Genom Inc Adv Genom Technol Lab San Jose CA 95134 USA;

    Complete Genom Inc Adv Genom Technol Lab San Jose CA 95134 USA;

    Complete Genom Inc Adv Genom Technol Lab San Jose CA 95134 USA;

    Complete Genom Inc Adv Genom Technol Lab San Jose CA 95134 USA;

    Complete Genom Inc Adv Genom Technol Lab San Jose CA 95134 USA;

    Complete Genom Inc Adv Genom Technol Lab San Jose CA 95134 USA;

    BGI Shenzhen Shenzhen 518083 Peoples R China;

    BGI Shenzhen Shenzhen 518083 Peoples R China;

    BGI Shenzhen Shenzhen 518083 Peoples R China;

    BGI Shenzhen Shenzhen 518083 Peoples R China;

    BGI Shenzhen Shenzhen 518083 Peoples R China;

    BGI Shenzhen Shenzhen 518083 Peoples R China;

    BGI Shenzhen Shenzhen 518083 Peoples R China;

    Complete Genom Inc Adv Genom Technol Lab San Jose CA 95134 USA;

    BGI Shenzhen Shenzhen 518083 Peoples R China;

    BGI Shenzhen Shenzhen 518083 Peoples R China;

    BGI Shenzhen Shenzhen 518083 Peoples R China;

    BGI Shenzhen Shenzhen 518083 Peoples R China;

    BGI Shenzhen Shenzhen 518083 Peoples R China;

    BGI Shenzhen Shenzhen 518083 Peoples R China;

  • 收录信息
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类 医学遗传学;
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号