...
首页> 外文期刊>GigaScience >De novo transcriptome assembly: A comprehensive cross-species comparison of short-read RNA-Seq assemblers
【24h】

De novo transcriptome assembly: A comprehensive cross-species comparison of short-read RNA-Seq assemblers

机译:从头转录组装配:短读RNA-Seq装配子的全面跨物种比较

获取原文
           

摘要

Background In recent years, massively parallel complementary DNA sequencing (RNA sequencing [RNA-Seq]) has emerged as a fast, cost-effective, and robust technology to study entire transcriptomes in various manners. In particular, for non-model organisms and in the absence of an appropriate reference genome, RNA-Seq is used to reconstruct the transcriptome de?novo . Although the de?novo transcriptome assembly of non-model organisms has been on the rise recently and new tools are frequently developing, there is still a knowledge gap about which assembly software should be used to build a comprehensive de?novo assembly. Results Here, we present a large-scale comparative study in which 10 de?novo assembly tools are applied to 9 RNA-Seq data sets spanning different kingdoms of life. Overall, we built 200 single assemblies and evaluated their performance on a combination of 20 biological-based and reference-free metrics. Our study is accompanied by a comprehensive and extensible Electronic Supplement that summarizes all data sets, assembly execution instructions, and evaluation results. Trinity , SPAdes , and Trans-ABySS , followed by Bridger and SOAPdenovo-Trans , generally outperformed the other tools compared. Moreover, we observed species-specific differences in the performance of each assembler. No tool delivered the best results for all data sets. Conclusions We recommend a careful choice and normalization of evaluation metrics to select the best assembling results as a critical step in the reconstruction of a comprehensive de?novo transcriptome assembly.
机译:背景技术近年来,大规模并行互补DNA测序(RNA测序[RNA-Seq])已成为一种快速,经济高效且健壮的技术,可以多种方式研究整个转录组。特别是,对于非模式生物,并且在没有适当的参考基因组的情况下,RNA-Seq可用于重建转录组修饰。尽管非模型生物的denovo转录组组装最近在增长,并且新工具正在频繁开发,但是对于应该使用哪种组装软件来构建全面的denovo组装仍存在知识空白。结果在这里,我们提出了一项大规模的比较研究,其中将10个denovo组装工具应用于跨越不同生命王国的9个RNA-Seq数据集。总体而言,我们构建了200多个单个程序集,并结合了20种基于生物学和无参考标准的指标评估了它们的性能。我们的研究伴随着全面且可扩展的电子补编,该补编总结了所有数据集,组装执行说明和评估结果。 Trinity,SPAdes和Trans-ABySS,其次是Bridger和SOAPdenovo-Trans,通常比其他工具好。此外,我们在每个汇编器的性能中观察到了特定于物种的差异。没有工具能够为所有数据集提供最佳结果。结论我们建议仔细选择评估指标并对其进行归一化,以选择最佳的组装结果,这是重建完整的denovo转录组组装的关键步骤。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号