首页> 外文期刊>GigaScience >Binning unassembled short reads based on k-mer abundance covariance using sparse coding
【24h】

Binning unassembled short reads based on k-mer abundance covariance using sparse coding

机译:基于K-MER丰富协方差使用稀疏编码,分融合未使用的简短读数

获取原文
       

摘要

Background: Sequence-binning techniques enable the recovery of an increasing number of genomes from complex microbial metagenomes and typically require prior metagenome assembly, incurring the computational cost and drawbacks of the latter, e.g., biases against low-abundance genomes and inability to conveniently assemble multi-terabyte datasets. Results: We present here a scalable pre-assembly binning scheme (i.e., operating on unassembled short reads) enabling latent genome recovery by leveraging sparse dictionary learning and elastic-net regularization, and its use to recover hundreds of metagenome-assembled genomes, including very low-abundance genomes, from a joint analysis of microbiomes from the LifeLines DEEP population cohort (n = 1,135, 1010 reads). Conclusion: We showed that sparse coding techniques can be leveraged to carry out read-level binning at large scale and that, despite lower genome reconstruction yields compared to assembly-based approaches, bin-first strategies can complement the more widely used assembly-first protocols by targeting distinct genome segregation profiles. Read enrichment levels across 6 orders of magnitude in relative abundance were observed, indicating that the method has the power to recover genomes consistently segregating at low levels.
机译:背景:序列排序技术能够恢复来自复杂的微生物偏见组的越来越多的基因组,并且通常需要先前的偏射组件,这产生了后者的计算成本和缺点,例如,对低丰度基因组的偏差,无法方便地组装多个-terabyte数据集。结果:我们在此提供可扩展的预组装搭档方案(即,在未组装的短读数上运行)通过利用稀疏的字典学习和弹性净正规化来实现潜在的基因组恢复,并用来恢复数百种组装组装的基因组,包括非常从Lifelines深度群体队列(n = 1,135,> 1010读数)的微生物分析的低丰度基因组。结论:我们展示了稀疏的编码技术可以利用大规模进行读取级别镀纳,并且尽管与基于组装的方法相比,基因组重建产量较低,但垃圾箱可以补充更广泛使用的组装第一协议通过靶向不同的基因组隔离型材。观察到在相对丰度中的6个级的读取浓缩水平,表明该方法具有在低水平下始终隔离基因组的功率。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号