首页> 外文期刊>GigaScience >MinION? nanopore sequencing of environmental metagenomes: a synthetic approach
【24h】

MinION? nanopore sequencing of environmental metagenomes: a synthetic approach

机译:奴才?环境元基因组的纳米孔测序:一种合成方法

获取原文
           

摘要

Background: Environmental metagenomic analysis is typically accomplished by assigning taxonomy and/or function from whole genome sequencing or 16S amplicon sequences. Both of these approaches are limited, however, by read length, among other technical and biological factors. A nanopore-based sequencing platform, MinION?, produces reads that are ≥1 × 104 bp in length, potentially providing for more precise assignment, thereby alleviating some of the limitations inherent in determining metagenome composition from short reads. We tested the ability of sequence data produced by MinION (R7.3 flow cells) to correctly assign taxonomy in single bacterial species runs and in three types of low-complexity synthetic communities: a mixture of DNA using equal mass from four species, a community with one relatively rare (1%) and three abundant (33% each) components, and a mixture of genomic DNA from 20 bacterial strains of staggered representation. Taxonomic composition of the low-complexity communities was assessed by analyzing the MinION sequence data with three different bioinformatic approaches: Kraken, MG-RAST, and One Codex. Results: Long read sequences generated from libraries prepared from single strains using the version 5 kit and chemistry, run on the original MinION device, yielded as few as 224 to as many as 3497 bidirectional high-quality (2D) reads with an average overall study length of 6000 bp. For the single-strain analyses, assignment of reads to the correct genus by different methods ranged from 53.1% to 99.5%, assignment to the correct species ranged from 23.9% to 99.5%, and the majority of misassigned reads were to closely related organisms. A synthetic metagenome sequenced with the same setup yielded 714 high quality 2D reads of approximately 5500 bp that were up to 98% correctly assigned to the species level. Synthetic metagenome MinION libraries generated using version 6 kit and chemistry yielded from 899 to 3497 2D reads with lengths averaging 5700 bp with up to 98% assignment accuracy at the species level. The observed community proportions for “equal” and “rare” synthetic libraries were close to the known proportions, deviating from 0.1% to 10% across all tests. For a 20-species mock community with staggered contributions, a sequencing run detected all but 3 species (each included at 99% of reads were assigned to the correct family. Conclusions: At the current level of output and sequence quality (just under 4 × 103 2D reads for a synthetic metagenome), MinION sequencing followed by Kraken or One Codex analysis has the potential to provide rapid and accurate metagenomic analysis where the consortium is comprised of a limited number of taxa. Important considerations noted in this study included: high sensitivity of the MinION platform to the quality of input DNA, high variability of sequencing results across libraries and flow cells, and relatively small numbers of 2D reads per analysis limit. Together, these limited detection of very rare components of the microbial consortia, and would likely limit the utility of MinION for the sequencing of high-complexity metagenomic communities where thousands of taxa are expected. Furthermore, the limitations of the currently available data analysis tools suggest there is considerable room for improvement in the analytical approaches for the characterization of microbial communities using long reads. Nevertheless, the fact that the accurate taxonomic assignment of high-quality reads generated by MinION is approaching 99.5% and, in most cases, the inferred community structure mirrors the known proportions of a synthetic mixture warrants further exploration of practical application to environmental metagenomics as the platform continues to develop and improve. With further improvement in sequence throughput and error rate reduction, this platform shows great promise for precise real-time analysis of the composition and structure of more complex microbial communities.
机译:背景:环境宏基因组学分析通常是通过分配全基因组测序或16S扩增子序列的分类学和/或功能来完成的。但是,这两种方法都受到读取长度以及其他技术和生物学因素的限制。基于纳米孔的测序平台MinION?可产生长度≥1×10 4 bp的读数,可能提供更精确的分配,从而减轻了从短时间确定元基因组组成的固有局限性读。我们测试了MinION(R7.3流通池)产生的序列数据在单个细菌物种运行和三种类型的低复杂度合成群落中正确分配分类的能力:使用四种物种等质量的DNA混合物,一个群落具有一种相对稀有的成分(占1%)和三种丰富的成分(各占33%),以及来自20个交错表示的细菌菌株的基因组DNA混合物。通过使用三种不同的生物信息学方法(Kraken,MG-RAST和One Codex)分析MinION序列数据,评估了低复杂度群落的生物分类组成。结果:在原始MinION装置上运行,使用版本5试剂盒和化学方法从单一菌株制备的文库中生成的长读序列,在平均总体研究中产生的读物少至224到多达3497个双向高质量(2D)长度为6000 bp。对于单菌株分析,通过不同方法将读物分配给正确属的范围为53.1%至99.5%,将正确种属分配的范围为23.9%至99.5%,并且大多数错误分配的读物都是与紧密相关的生物。以相同设置测序的合成元基因组产生了约5500 bp的714个高质量2D读数,正确分配给物种水平的比例高达98%。使用版本6试剂盒和化学方法生成的合成元基因组MinION文库可产生899个至3497个2D读段,平均长度为5700 bp,在物种水平上的准确度高达98%。观察到的“相等”和“稀有”合成文库的社区比例接近已知比例,在所有测试中均从0.1%降低到10%。对于一个具有交错贡献的20种模拟社区,测序运行检测到除3种外的所有物种(每个包含99%的读段都分配给了正确的科。)结论:在当前水平的输出和序列质量(不到4× 10 3 二维读取合成的基因组),MinION测序后进行Kraken或One Codex分析有可能提供快速而准确的宏基因组分析,其中该财团由数量有限的分类单元组成。在这项研究中提到的内容包括:MinION平台对输入DNA的质量高度敏感,文库和流通池中测序结果的高度可变性以及每个分析极限的2D读数数量相对较少,这些对非常稀有成分的检测受到限制菌群,可能会限制MinION在测序高度复杂的宏基因组学社区(预计会有数千个分类单元)中进行测序的效用。当前可用的数据分析工具的局限性表明,使用长读取数据表征微生物群落的分析方法仍有很大的改进空间。尽管如此,由MinION生成的高质量读段的准确分类学分配接近99.5%的事实,并且在大多数情况下,推断的群落结构反映了合成混合物的已知比例,因此有必要进一步探索实际应用到环境宏基因组学中,因为平台不断发展和完善。随着序列通量的进一步提高和错误率的降低,该平台显示出对更复杂的微生物群落组成和结构进行精确实时分析的巨大希望。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号