...
首页> 外文期刊>BMC Genomics >A scalable and memory-efficient algorithm for de novo transcriptome assembly of non-model organisms
【24h】

A scalable and memory-efficient algorithm for de novo transcriptome assembly of non-model organisms

机译:用于非模型生物的从头转录组组装的可扩展且高效存储的算法

获取原文
           

摘要

Background With increased availability of de novo assembly algorithms, it is feasible to study entire transcriptomes of non-model organisms. While algorithms are available that are specifically designed for performing transcriptome assembly from high-throughput sequencing data, they are very memory-intensive, limiting their applications to small data sets with few libraries. Results We develop a transcriptome assembly algorithm that recovers alternatively spliced isoforms and expression levels while utilizing as many RNA-Seq libraries as possible that contain hundreds of gigabases of data. New techniques are developed so that computations can be performed on a computing cluster with moderate amount of physical memory. Conclusions Our strategy minimizes memory consumption while simultaneously obtaining comparable or improved accuracy over existing algorithms. It provides support for incremental updates of assemblies when new libraries become available.
机译:背景技术随着从头组装算法的增加,研究非模型生物的整个转录组是可行的。虽然有专门为从高通量测序数据执行转录组组装而设计的可用算法,但它们占用大量内存,因此将其应用程序限制在具有少量库的小型数据集上。结果我们开发了转录组组装算法,该算法可恢复交替剪接的同工型和表达水平,同时利用尽可能多的RNA-Seq文库,其中包含数百个千兆位数据。开发了新技术,以便可以在具有中等物理内存量的计算群集上执行计算。结论我们的策略可以最大程度地减少内存消耗,同时获得与现有算法相当或更高的精度。当新库可用时,它为程序集的增量更新提供支持。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号