首页> 外文期刊>Journal of Biomolecular Structure and Dynamics >An efficient binomial model-based measure for sequence comparison and its application.
【24h】

An efficient binomial model-based measure for sequence comparison and its application.

机译:一种有效的基于二项式模型的序列比较度量方法及其应用。

获取原文
获取原文并翻译 | 示例
           

摘要

Sequence comparison is one of the major tasks in bioinformatics, which could serve as evidence of structural and functional conservation, as well as of evolutionary relations. There are several similarity/dissimilarity measures for sequence comparison, but challenges remains. This paper presented a binomial model-based measure to analyze biological sequences. With help of a random indicator, the occurrence of a word at any position of sequence can be regarded as a random Bernoulli variable, and the distribution of a sum of the word occurrence is well known to be a binomial one. By using a recursive formula, we computed the binomial probability of the word count and proposed a binomial model-based measure based on the relative entropy. The proposed measure was tested by extensive experiments including classification of HEV genotypes and phylogenetic analysis, and further compared with alignment-based and alignment-free measures. The results demonstrate that the proposed measure based on binomial model is more efficient.
机译:序列比较是生物信息学的主要任务之一,可以作为结构和功能保守以及进化关系的证据。序列比较有几种相似/不相似的方法,但是仍然存在挑战。本文提出了一种基于二项式模型的方法来分析生物序列。借助于随机指示符,可以将单词在序列的任何位置上的出现视为随机伯努利变量,并且众所周知,单词出现的总和的分布是二项式的。通过使用递归公式,我们计算了字数的二项式概率,并基于相对熵提出了基于二项式模型的度量。通过广泛的实验对提议的措施进行了测试,包括HEV基因型的分类和系统发育分析,并进一步与基于比对和无比对的措施进行了比较。结果表明,所提出的基于二项式模型的度量更加有效。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号