首页> 美国卫生研究院文献>Bioinformatics >SPARSE: quadratic time simultaneous alignment and folding of RNAs without sequence-based heuristics
【2h】

SPARSE: quadratic time simultaneous alignment and folding of RNAs without sequence-based heuristics

机译:SPARSE:二次时间同时对齐和折叠RNA无需基于序列的启发式

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

>Motivation: RNA-Seq experiments have revealed a multitude of novel ncRNAs. The gold standard for their analysis based on simultaneous alignment and folding suffers from extreme time complexity of O(n6). Subsequently, numerous faster ‘Sankoff-style’ approaches have been suggested. Commonly, the performance of such methods relies on sequence-based heuristics that restrict the search space to optimal or near-optimal sequence alignments; however, the accuracy of sequence-based methods breaks down for RNAs with sequence identities below 60%. Alignment approaches like LocARNA that do not require sequence-based heuristics, have been limited to high complexity ( ≥  quartic time).>Results: Breaking this barrier, we introduce the novel Sankoff-style algorithm ‘sparsified prediction and alignment of RNAs based on their structure ensembles (SPARSE)’, which runs in quadratic time without sequence-based heuristics. To achieve this low complexity, on par with sequence alignment algorithms, SPARSE features strong sparsification based on structural properties of the RNA ensembles. Following PMcomp, SPARSE gains further speed-up from lightweight energy computation. Although all existing lightweight Sankoff-style methods restrict Sankoff’s original model by disallowing loop deletions and insertions, SPARSE transfers the Sankoff algorithm to the lightweight energy model completely for the first time. Compared with LocARNA, SPARSE achieves similar alignment and better folding quality in significantly less time (speedup: 3.7). At similar run-time, it aligns low sequence identity instances substantially more accurate than RAF, which uses sequence-based heuristics.>Availability and implementation: SPARSE is freely available at .>Contact: >Supplementary information: are available at Bioinformatics online.
机译:>动机:RNA-Seq实验揭示了许多新颖的ncRNA。用于基于同时对齐和折叠的分析的金标准遭受O(n 6 )的极端时间复杂性的困扰。随后,提出了许多更快的“桑科夫式”方法。通常,此类方法的性能依赖于基于序列的启发式方法,该方法将搜索空间限制为最佳或接近最佳的序列比对。但是,基于序列的方法的准确性会破坏序列同一性低于60%的RNA。诸如LocARNA之类的不需要基于序列的启发式方法的比对方法仅限于高复杂度(≥四次时间)。>结果:打破这一障碍,我们引入了新颖的Sankoff风格算法``简化的预测和基于其结构集合(SPARSE)的RNA序列比对,该序列以二次时间运行,没有基于序列的启发式算法。为了实现这种低复杂性,与序列比对算法相比,SPARSE具有基于RNA集成体的结构特性的强稀疏性。在PMcomp之后,SPARSE从轻量级能量计算中获得了进一步的加速。尽管所有现有的轻量级Sankoff样式方法都禁止循环删除和插入,从而限制了Sankoff的原始模型,但SPARSE首次将Sankoff算法完全转换为轻量级能量模型。与LocARNA相比,SPARSE可以在更少的时间内实现相似的比对和更好的折叠质量(加速:3.7)。在类似的运行时,它比使用基于序列的试探法的RAF对齐方式更准确地排列低序列同一性实例。>可用性和实现: SPARSE在以下位置免费提供。>联系方式: >补充信息:可在线访问生物信息学。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号