首页> 外文会议>Information Theory and Applications Workshop >Parametrized Stochastic Grammars for RNA Secondary Structure Prediction
【24h】

Parametrized Stochastic Grammars for RNA Secondary Structure Prediction

机译:RNA二级结构预测的参数化随机语法

获取原文

摘要

We propose a two-level stochastic context-free grammar (SCFG) architecture for parametrized stochastic modeling of a family of RNA sequences, including their secondary structure. A stochastic model of this type can be used for maximum a posteriori estimation of the secondary structure of any new sequence in the family. The proposed SCFG architecture models RNA subsequences comprising paired bases as stochastically weighted Dyck-language words, i.e., as weighted balanced-parenthesis expressions. The length of each run of unpaired bases, forming a loop or a bulge, is taken to have a phase-type distribution: that of the hitting time in a finite-state Markov chain. Without loss of generality, each such Markov chain can be taken to have a bounded complexity. The scheme yields an overall family SCFG with a manageable number of parameters.
机译:我们提出了一种用于分组的两级随机上下文语法(SCFG)架构,用于对RNA序列系列的参数化随机建模,包括其二级结构。这种类型的随机模型可用于最大限度地估计家庭中任何新序列的二级结构的后验估计。所提出的SCFG架构模型包括配对碱基作为随机加权的DYCK语言单词,即作为加权平衡括号表达式的RNA子件。每次运行形成环或凸起的每次运行的长度都具有相位类型的分布:有限状态马尔可夫链中的击球时间的长度。不损失一般性,每个这样的马尔可夫链可以被采用有界复杂性。该方案产生具有可管理数量的参数的整体族SCG。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号