首页> 外文会议>Pattern recognition in bioinformatics >A Maximum-Likelihood Formulationand EM Algorithm for the Protein Multiple Alignment Problem
【24h】

A Maximum-Likelihood Formulationand EM Algorithm for the Protein Multiple Alignment Problem

机译:蛋白质多重比对问题的最大似然公式 r n和EM算法

获取原文
获取原文并翻译 | 示例

摘要

A given group of protein sequences of different lengths is considered as resulting from random transformations of independent random ancestor sequences of the same preset smaller length, each produced in accordance with an unknown common probabilistic profile. We describe the process of transformation by a Hidden Markov Model (HMM) which is a direct generalization of the PAM model for amino acids. We formulate the problem of finding the maximum likelihood probabilistic ancestor profile and demonstrate its practicality. The proposed method of solving this problem allows for obtaining simultaneously the ancestor profile and the posterior distribution of its HMM, which permits efficient determination of the most probable multiple alignment of all the sequences. Results obtained on the BAliBASE 3.0 protein alignment benchmark indicate that the proposed method is generally more accurate than popular methods of multiple alignment such as CLUSTALW, DIALIGN and ProbAlign.
机译:给定一组不同长度的蛋白质序列是由于相同预设较小长度的独立随机祖先序列的随机转化而产生的,每个序列均根据未知的常见概率分布图产生。我们描述了通过隐马尔可夫模型(HMM)进行转化的过程,该模型是氨基酸PAM模型的直接概括。我们提出寻找最大似然概率祖先轮廓的问题,并证明其实用性。所提出的解决该问题的方法允许同时获得其HMM的祖先轮廓及其后部分布,这允许有效确定所有序列的最可能的多重比对。在BAliBASE 3.0蛋白质比对基准上获得的结果表明,该方法通常比流行的多重比对方法(如CLUSTALW,DIALIGN和ProbAlign)更准确。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号