...
首页> 外文期刊>Journal of Bioinformatics and Computational Biology >PRACTICALITY AND TIME COMPLEXITY OF A SPARSIFIED RNA FOLDING ALGORITHM
【24h】

PRACTICALITY AND TIME COMPLEXITY OF A SPARSIFIED RNA FOLDING ALGORITHM

机译:简化的RNA折叠算法的实用性和时间复杂度

获取原文
获取原文并翻译 | 示例

摘要

Commonly used RNA folding programs compute the minimum free energy structure of a sequence under the pseudoknot exclusion constraint. They are based on Zuker's algorithm which runs in time O(n3). Recently, it has been claimed that RNA folding can be achieved in average time O(n2) using a sparsification technique. A proof of quadratic time complexity was based on the assumption that computational RNA folding obeys the "polymer-zeta property". Several variants of sparse RNA folding algorithms were later developed. Here, we present our own version, which is readily applicable to existing RNA folding programs, as it is extremely simple and does not require any new data structure. We applied it to the widely used Vienna RNAfold program, to create sibRNAfold, the first public sparsified version of a standard RNA folding program. To gain a better understanding of the time complexity of sparsified RNA folding in general, we carried out a thorough run time analysis with synthetic random sequences, both in the context of energy minimization and base pairing maximization. Contrary to previous claims, the asymptotic time complexity of a sparsified RNA folding algorithm using standard energy parameters remains O(n3) under a wide variety of conditions. Consistent with our run-time analysis, we found that RNA folding does not obey the "polymer-zeta property" as claimed previously. Yet, a basic version of a sparsified RNA folding algorithm provides 15- to 50-fold speed gain. Surprisingly, the same sparsification technique has a different effect when applied to base pairing optimization. There, its asymptotic running time complexity appears to be either quadratic or cubic depending on the base composition. The code used in this work is available at: http://sibRNAfold.sourceforge.net/.
机译:常用的RNA折叠程序可在假结排除约束条件下计算序列的最小自由能结构。它们基于运行时间为O(n3)的Zuker算法。最近,有人声称可以使用稀疏化技术在平均时间O(n2)中实现RNA折叠。二次时间复杂度的证明是基于这样的假设,即计算的RNA折叠遵循“聚合物-zeta特性”。后来开发了稀疏RNA折叠算法的几种变体。在这里,我们介绍我们自己的版本,因为它非常简单并且不需要任何新的数据结构,因此可以轻松地应用于现有的RNA折叠程序。我们将其应用于广泛使用的Vienna RNAfold程序,以创建sibRNAfold,这是标准RNA折叠程序的第一个公共稀疏版本。为了更好地理解稀疏RNA折叠的时间复杂性,我们在能量最小化和碱基配对最大化的背景下,对合成随机序列进行了全面的运行时分析。与之前的权利要求相反,使用标准能量参数的稀疏RNA折叠算法的渐近时间复杂度在各种条件下仍为O(n3)。与我们的运行时分析一致,我们发现RNA折叠不符合先前所声称的“聚合物-zeta性质”。但是,稀疏RNA折叠算法的基本版本可提供15到50倍的速度增益。令人惊讶的是,当应用于碱基配对优化时,相同的稀疏化技术具有不同的效果。在那里,根据基本组成,其渐近运行时间复杂度似乎是二次方还是三次方。该工作中使用的代码可在以下位置获得:http://sibRNAfold.sourceforge.net/。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号