首页> 美国卫生研究院文献>Bioinformatics >Faster computation of exact RNA shape probabilities
【2h】

Faster computation of exact RNA shape probabilities

机译:更快地计算精确的RNA形状概率

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

>Motivation: Abstract shape analysis allows efficient computation of a representative sample of low-energy foldings of an RNA molecule. More comprehensive information is obtained by computing shape probabilities, accumulating the Boltzmann probabilities of all structures within each abstract shape. Such information is superior to free energies because it is independent of sequence length and base composition. However, up to this point, computation of shape probabilities evaluates all shapes simultaneously and comes with a computation cost which is exponential in the length of the sequence.>Results: We device an approach called RapidShapes that computes the shapes above a specified probability threshold T by generating a list of promising shapes and constructing specialized folding programs for each shape to compute its share of Boltzmann probability. This aims at a heuristic improvement of runtime, while still computing exact probability values.>Conclusion: Evaluating this approach and several substrategies, we find that only a small proportion of shapes have to be actually computed. For an RNA sequence of length 400, this leads, depending on the threshold, to a 10–138 fold speed-up compared with the previous complete method. Thus, probabilistic shape analysis has become feasible in medium-scale applications, such as the screening of RNA transcripts in a bacterial genome.>Availability: RapidShapes is available via >Contact: >Supplementary information: are available at Bioinformatics online.
机译:>动机:抽象形状分析可以有效地计算RNA分子低能折叠的代表性样品。通过计算形状概率,累积每个抽象形状内所有结构的玻尔兹曼概率,可以获得更全面的信息。这样的信息优于自由能,因为它与序列长度和碱基组成无关。但是,到目前为止,形状概率的计算会同时评估所有形状,并且其计算成本在序列的长度上成指数增长。>结果:我们采用一种称为RapidShapes的方法来计算形状通过生成有希望的形状的列表并为每种形状构建专门的折叠程序以计算其玻尔兹曼概率份额,可以将其设置为高于指定的概率阈值T。这样做的目的是在启发式地改善运行时的同时,仍能计算出准确的概率值。>结论:对这种方法和一些基础方法进行评估后,我们发现实际上只需要计算一小部分形状。对于长度为400的RNA序列,取决于阈值,与以前的完整方法相比,可加快10–138倍。因此,概率形状分析已在中等规模的应用中变得可行,例如在细菌基因组中筛选RNA转录本。>可用性: RapidShapes可通过>联系方式: > >补充信息:可从生物信息学在线获得。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号