首页> 外文期刊>Bioinformatics >Pair stochastic tree adjoining grammars for aligning and predicting pseudoknot RNA structures
【24h】

Pair stochastic tree adjoining grammars for aligning and predicting pseudoknot RNA structures

机译:配对随机树邻接语法以对齐和预测假结RNA结构

获取原文
获取原文并翻译 | 示例
           

摘要

Motivation: Since the whole genome sequences of many species have been determined, computational prediction of RNA secondary structures and computational identification of those non-coding RNA regions by comparative genomics become important. Therefore, more advanced alignment methods are required. Recently, an approach of structural alignment for RNA sequences has been introduced to solve these problems. Pair hidden Markov models on tree structures (PHMMTSs) proposed by Sakakibara are efficient automata-theoretic models for structural alignment of RNA secondary structures, although PHMMTSs are incapable of handling pseudoknots. On the other hand, tree adjoining grammars (TAGs), a subclass of context-sensitive grammars, are suitable for modeling pseudoknots. Our goal is to extend PHMMTSs by incorporating TAGs to be able to handle pseudoknots.Results: We propose pair stochastic TAGs (PSTAGs) for aligning and predicting RNA secondary structures including a simple type of pseudoknot which can represent most known pseudoknot structures. First, we extend PHMMTSs defined on alignment of 'trees' to PSTAGs defined on alignment of 'TAG trees' which represent derivation processes of TAGs and are functionally equivalent to derived trees of TAGs. Then, we develop an efficient dynamic programming algorithm of PSTAGs for obtaining an optimal structural alignment including pseudoknots. We implement the PSTAG algorithm and demonstrate the properties of the algorithm by using it to align and predict several small pseudoknot structures. We believe that our implemented program based on PSTAGs is the first grammar-based and practically executable software for comparative analyses of RNA pseudoknot structures, and, further, non-coding RNAs.
机译:动机:由于已经确定了许多物种的全基因组序列,因此通过比较基因组学对RNA二级结构的计算预测以及对那些非编码RNA区域的计算鉴定变得非常重要。因此,需要更高级的对准方法。最近,已经引入了RNA序列的结构比对方法来解决这些问题。 Sakakibara提出的树结构上的配对隐马尔可夫模型(PHMMTS)是有效的自动机理论模型,用于RNA二级结构的结构比对,尽管PHMMTS无法处理假结。另一方面,树关联语法(TAG)是上下文相关语法的子类,适合于模拟假结。我们的目标是通过合并TAG来扩展PHMMTS,从而能够处理假结。结果:我们提出了配对随机TAG(PSTAG),用于对齐和预测RNA二级结构,包括可以代表大多数已知假结结构的简单类型的假结。首先,我们将“树”的对齐方式定义的PHMMTS扩展到“ TAG树”的对齐方式定义的PSTAG,PSTAG表示TAG的派生过程,并在功能上等同于TAG的派生树。然后,我们开发了一种有效的PSTAG动态规划算法,以获得包括假结在内的最佳结构排列。我们实现了PSTAG算法,并通过使用它来对齐和预测几个小的伪结结构来演示算法的性能。我们相信,我们基于PSTAGs实施的程序是第一个基于语法且可实际执行的软件,可用于RNA假结结构以及非编码RNA的比较分析。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号