首页> 美国卫生研究院文献>Biology >Algorithms for Hidden Markov Models Restricted to Occurrences of Regular Expressions
【2h】

Algorithms for Hidden Markov Models Restricted to Occurrences of Regular Expressions

机译:受限于正则表达式出现的隐马尔可夫模型算法

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

Hidden Markov Models (HMMs) are widely used probabilistic models, particularly for annotating sequential data with an underlying hidden structure. Patterns in the annotation are often more relevant to study than the hidden structure itself. A typical HMM analysis consists of annotating the observed data using a decoding algorithm and analyzing the annotation to study patterns of interest. For example, given an HMM modeling genes in DNA sequences, the focus is on occurrences of genes in the annotation. In this paper, we define a pattern through a regular expression and present a restriction of three classical algorithms to take the number of occurrences of the pattern in the hidden sequence into account. We present a new algorithm to compute the distribution of the number of pattern occurrences, and we extend the two most widely used existing decoding algorithms to employ information from this distribution. We show experimentally that the expectation of the distribution of the number of pattern occurrences gives a highly accurate estimate, while the typical procedure can be biased in the sense that the identified number of pattern occurrences does not correspond to the true number. We furthermore show that using this distribution in the decoding algorithms improves the predictive power of the model.
机译:隐马尔可夫模型(HMM)是广泛使用的概率模型,尤其是用于使用基础隐藏结构注释顺序数据。注释中的图案通常比隐藏结构本身与研究更相关。典型的HMM分析包括使用解码算法对观察到的数据进行注释并分析注释以研究感兴趣的模式。例如,给定HMM对DNA序列中的基因进行建模,重点是注释中基因的出现。在本文中,我们通过正则表达式定义模式,并提出了三种经典算法的限制,以考虑隐藏序列中模式出现的次数。我们提出了一种新的算法来计算模式出现次数的分布,并且我们扩展了两种最广泛使用的现有解码算法,以利用该分布中的信息。我们通过实验表明,对模式出现次数分布的期望给出了高度准确的估计,而在识别出的模式出现次数与真实次数不符的意义上,典型过程可能会产生偏差。我们进一步表明,在解码算法中使用这种分布可以提高模型的预测能力。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号