A scalable and memory-efficient algorithm for de novo transcriptome assembly of non-model organisms

Sing-Hoi Sze; Meaghan L. Pimsler; Jeffery K. Tomberlin; Corbin D. Jones; Aaron M. Tarone

首页> 外文期刊>BMC Genomics >A scalable and memory-efficient algorithm for de novo transcriptome assembly of non-model organisms

【24h】

A scalable and memory-efficient algorithm for de novo transcriptome assembly of non-model organisms

机译：用于非模型生物的从头转录组组装的可扩展且高效存储的算法

获取原文

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Background With increased availability of de novo assembly algorithms, it is feasible to study entire transcriptomes of non-model organisms. While algorithms are available that are specifically designed for performing transcriptome assembly from high-throughput sequencing data, they are very memory-intensive, limiting their applications to small data sets with few libraries. Results We develop a transcriptome assembly algorithm that recovers alternatively spliced isoforms and expression levels while utilizing as many RNA-Seq libraries as possible that contain hundreds of gigabases of data. New techniques are developed so that computations can be performed on a computing cluster with moderate amount of physical memory. Conclusions Our strategy minimizes memory consumption while simultaneously obtaining comparable or improved accuracy over existing algorithms. It provides support for incremental updates of assemblies when new libraries become available.

机译：背景技术随着从头组装算法的增加，研究非模型生物的整个转录组是可行的。虽然有专门为从高通量测序数据执行转录组组装而设计的可用算法，但它们占用大量内存，因此将其应用程序限制在具有少量库的小型数据集上。结果我们开发了转录组组装算法，该算法可恢复交替剪接的同工型和表达水平，同时利用尽可能多的RNA-Seq文库，其中包含数百个千兆位数据。开发了新技术，以便可以在具有中等物理内存量的计算群集上执行计算。结论我们的策略可以最大程度地减少内存消耗，同时获得与现有算法相当或更高的精度。当新库可用时，它为程序集的增量更新提供支持。

著录项

来源
《BMC Genomics》 |2017年第4期|共页
作者
Sing-Hoi Sze; Meaghan L. Pimsler; Jeffery K. Tomberlin; Corbin D. Jones; Aaron M. Tarone;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种
中图分类医学遗传学;
关键词

相似文献

外文文献
中文文献
专利

1. Optimization of de novo transcriptome assembly from high-throughput short read sequencing data improves functional annotation for non-model organisms [J] . Berat Z Haznedaroglu, Darryl Reeves, Hamid Rismani-Yazdi, BMC Bioinformatics . 2012,第1期

机译：从高通量短读取测序数据的De Novo转录组组件的优化改善了非模型生物的功能注释
2. A divide-and-conquer algorithm for large-scale de novo transcriptome assembly through combining small assemblies from existing algorithms [J] . Sing-Hoi Sze, Jonathan J. Parrott, Aaron M. Tarone BMC Genomics . 2017,第10期

机译：通过结合现有算法中的小程序集进行大规模从头转录组组装的分治法
3. TransFlow: a modular framework for assembling and assessing accurate de novo transcriptomes in non-model organisms [J] . Pedro Seoane, Marina Espigares, Rosario Carmona, BMC Bioinformatics . 2018,第14期

机译：TransFlow：用于在非模式生物中组装和评估准确的从头转录组的模块化框架
4. Obtaining the Most Accurate de novo Transcriptomes for Non-model Organisms: The Case of Castanea sativa [C] . Marina Espigares, Pedro Seoane, Rocio Bautista, International work-conference on bioinformatics and biomedical engineering . 2017

机译：获取非模式生物的最准确的从头转录组：栗木
5. De Novo Transcriptome Assembly, Functional Annotation, and SNP Discovery in North American Flying Squirrels (Genus Glaucomys) [D] . Brown, Michael G. C. 2018

机译：De Novo转录组组装，功能注释和SNP在北美飞鼠中的发现（属青光眼）
6. A scalable and memory-efficient algorithm for de novo transcriptome assembly of non-model organisms [O] . Sing-Hoi Sze, Meaghan L. Pimsler, Jeffery K. Tomberlin, 2017

机译：用于非模型生物的从头转录组组装的可扩展且高效存储的算法
7. Optimization of de novo transcriptome assembly from high-throughput short read sequencing data improves functional annotation for non-model organisms [O] . Berat Z Haznedaroglu, Darryl Reeves, Hamid Rismani-Yazdi, 2012

机译：从高通量短读测序数据优化从头转录组装配，可改善非模式生物的功能注释

A scalable and memory-efficient algorithm for de novo transcriptome assembly of non-model organisms

摘要

著录项

相似文献

相关主题

期刊订阅