【24h】

Scoring Matrices That Induce Metrics on Sequences

机译:对序列进行度量的评分矩阵

获取原文
获取原文并翻译 | 示例

摘要

Scoring matrices are widely used in sequence comparisons. A scoring matrix γ is indexed by symbols of an alphabet. The entry in γ in row a and column b measures the cost of the edit operation of replacing symbol a by symbol b. For a given scoring matrix and sequences s and t, we consider two kinds of induced scoring functions. The first function, known as weighted edit distance, is defined as the sura of costs of the edit operations required to transform s into t. The second, known as normalized edit distance, is defined as the minimum quotient between the sum of costs of edit operations to transform s into t and the number of the corresponding edit operations. In this work we characterize the class of scoring matrices for which the induced weighted edit distance is actually a metric. We do the same for the normalized edit distance.
机译:评分矩阵广泛用于序列比较。得分矩阵γ由字母的符号索引。在第a行和第b列的γ条目中,测量了用符号b替换符号a的编辑操作的成本。对于给定的得分矩阵以及序列s和t,我们考虑两种诱导得分函数。第一个函数称为加权编辑距离,定义为将s转换为t所需的编辑操作成本的总和。第二个距离,即标准化的编辑距离,定义为将s转换为t的编辑操作的总成本与相应编辑操作的数量之间的最小商。在这项工作中,我们表征了评分矩阵的类别,对于该评分矩阵,诱导的加权编辑距离实际上是一个度量。我们对归一化的编辑距离执行相同的操作。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号