首页> 外文学位 >Bayesian parameter and order estimation in profile hidden Markov models.
【24h】

Bayesian parameter and order estimation in profile hidden Markov models.

机译:轮廓隐藏Markov模型中的贝叶斯参数和阶数估计。

获取原文
获取原文并翻译 | 示例

摘要

Hidden Markov models are used as tools for pattern recognition in a number of areas, ranging from speech processing to biological sequence analysis. Profile hidden Markov models represent a class of so-called "left-right" models that have an architecture that is specifically relevant to classification of proteins into structural families based on their amino acid sequences. Standard learning methods for such models employ a variety of heuristics applied to the expectation-maximization implementation of the maximum likelihood estimation procedure in order to find the global maximum of the likelihood function.; In this thesis, maximum likelihood estimation is compared to fully Bayesian estimation of parameters for profile hidden Markov models with a small number of parameters. Bayesian methods are found to assign higher scores to data sequences that are distantly related to the pattern consensus, show better performance in classifying these sequences correctly, and continue to perform robustly with regard to misspecification of the number of model parameters, relative to maximum likelihood methods. Though the study is limited in scope, the results are expected to remain relevant for models with a large number of parameters and other types of left-right hidden Markov models.; Further, higher order hidden Markov models are investigated and the standard method of discerning Markov order, the Bayesian Information Criterion (BIC). The results show that the overlap in emission distributions from different hidden states is a dominant factor governing the BICs ability to correctly discern model order.
机译:隐藏的马尔可夫模型在许多领域中用作模式识别的工具,范围从语音处理到生物序列分析。轮廓隐藏的马尔可夫模型代表一类所谓的“左右”模型,这些模型的结构与基于蛋白质的氨基酸序列将蛋白质分类为结构家族特别相关。用于此类模型的标准学习方法采用各种启发式方法,将其应用于最大似然估计过程的期望最大化实现中,以便找到似然函数的全局最大值。在本文中,将最大似然估计与具有少量参数的轮廓隐藏Markov模型的参数的完全贝叶斯估计进行比较。相对于最大似然方法,发现贝叶斯方法为与模式一致性远相关的数据序列分配更高的分数,在正确分类这些序列方面显示出更好的性能,并且在模型参数数量的错误指定方面继续表现出色。尽管研究范围有限,但预期结果仍将与具有大量参数的模型以及其他类型的左右隐马尔可夫模型相关。此外,还研究了高阶隐马尔可夫模型,并通过贝叶斯信息准则(BIC)来识别马尔可夫阶。结果表明,来自不同隐藏状态的排放分布的重叠是控制BIC正确识别模型顺序的能力的主要因素。

著录项

  • 作者

    Lewis, Steven James.;

  • 作者单位

    The Claremont Graduate University.;

  • 授予单位 The Claremont Graduate University.;
  • 学科 Mathematics.
  • 学位 Ph.D.
  • 年度 2007
  • 页码 196 p.
  • 总页数 196
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类 数学;
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号