首页> 美国卫生研究院文献>Nucleic Acids Research >Probabilistic error correction for RNA sequencing
【2h】

Probabilistic error correction for RNA sequencing

机译:RNA测序的概率误差校正

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

Sequencing of RNAs (RNA-Seq) has revolutionized the field of transcriptomics, but the reads obtained often contain errors. Read error correction can have a large impact on our ability to accurately assemble transcripts. This is especially true for de novo transcriptome analysis, where a reference genome is not available. Current read error correction methods, developed for DNA sequence data, cannot handle the overlapping effects of non-uniform abundance, polymorphisms and alternative splicing. Here we present SEquencing Error CorrEction in Rna-seq data (SEECER), a hidden Markov Model (HMM)–based method, which is the first to successfully address these problems. SEECER efficiently learns hundreds of thousands of HMMs and uses these to correct sequencing errors. Using human RNA-Seq data, we show that SEECER greatly improves on previous methods in terms of quality of read alignment to the genome and assembly accuracy. To illustrate the usefulness of SEECER for de novo transcriptome studies, we generated new RNA-Seq data to study the development of the sea cucumber Parastichopus parvimensis. Our corrected assembled transcripts shed new light on two important stages in sea cucumber development. Comparison of the assembled transcripts to known transcripts in other species has also revealed novel transcripts that are unique to sea cucumber, some of which we have experimentally validated.>Supporting website: .
机译:RNA测序(RNA-Seq)彻底改变了转录组学领域,但获得的读段通常包含错误。读取错误校正会严重影响我们准确组装成绩单的能力。对于没有参考基因组的从头转录组分析,尤其如此。针对DNA序列数据开发的当前读取错误校正方法无法处理不均匀丰度,多态性和选择性剪接的重叠效应。在这里,我们介绍基于Rna-seq数据的序列错误校正(SEECER),这是一种基于隐马尔可夫模型(HMM)的方法,它是成功解决这些问题的第一个方法。 SEECER有效地学习了数十万个HMM,并使用它们来纠正定序错误。使用人RNA-Seq数据,我们显示SEECER在与基因组的阅读比对质量和组装准确性方面大大改善了以前的方法。为了说明SEECER在从头转录组研究中的有用性,我们生成了新的RNA-Seq数据来研究海参Parastichopus parvimensis的发育。我们更正的汇编成绩单为海参开发的两个重要阶段提供了新的思路。将组装的转录本与其他物种的已知转录本进行比较,还发现了海参特有的新颖转录本,其中一些已通过实验验证。>支持网站:

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号