首页> 外文会议>International Conference on Bioinformatics Models, Methods and Algorithms >On using Longer RNA-seq Reads to Improve Transcript Prediction Accuracy
【24h】

On using Longer RNA-seq Reads to Improve Transcript Prediction Accuracy

机译:使用较长的RNA-SEQ读取以提高转录物预测精度

获取原文

摘要

Over the past decade, sequencing read length has increased from tens to hundreds and then to thousands of bases. Current cDNA synthesis methods prevent RNA-seqreads from being long enough to entirely capture all the RNA transcripts, but long reads can still provide connectivity information on chains of multiple exons that are included in transcripts. We demonstrate that exploiting full connectivity information leads to significantly higher prediction accuracy, as measured by the F-score. For this purpose we implemented the solution to the Minimum Path Cover with Subpath Constraints problem introduced in (Rizzi et al., 2014), which is an extension of the classical Minimum Path Cover problem and was shown solvable by min-cost flows. We show that, under hypothetical conditions of perfect sequencing, our approach is able to use long reads more effectively than two state-of-the-art tools, StringTie and FlipFlop. Even in this setting the problem is not trivial, and errors in the underlying flow graph introduced by sequencing and alignment errors complicate the problem further. As such our work also demonstrates the need for a development of a good spliced read aligner for long reads. Our proof-of-concept implementation is available at http://www.cs.helsinki.fi/en/gsa/traphlor.
机译:在过去的十年中,测序读长度从数十到数百个增加到数千个基础。电流cDNA合成方法可防止RNA-SEQREADS足够长,以完全捕获所有RNA转录物,但是长读取仍然可以提供关于在转录物中包括的多个外显子链的连接信息。我们展示利用完全连接信息导致通过F分数测量的预测准确性显着更高。为此目的,我们将解决方案实施到最小路径覆盖范围内引入的子路径限制问题(Rizzi等,2014),这是经典最小路径覆盖问题的延伸,并被最小成本流动所解释。我们表明,在完美排序的假设条件下,我们的方法能够比两个最先进的工具,Stringtie和Flipflop更有效地使用长读。即使在这个设置中,问题也不是微不足道的,并且通过排序和对齐误差引入的底层流图中的错误进一步使问题复杂化。因此,我们的工作也表明需要开发一个很好的拼接读取对齐器,用于长读取。我们的概念验证实现可在http://www.cs.helsinki.fi/en/gsa/traphlor提供。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号