Levenshtein Distance, Sequence Comparison and Biological Database Search

Berger Bonnie; Waterman Michael S.; Yu Yun William

首页> 外文期刊>IEEE Transactions on Information Theory >Levenshtein Distance, Sequence Comparison and Biological Database Search

【24h】

Levenshtein Distance, Sequence Comparison and Biological Database Search

机译：Levenshtein距离，序列比较和生物数据库搜索

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Levenshtein edit distance has played a central role-both past and present-in sequence alignment in particular and biological database similarity search in general. We start our review with a history of dynamic programming algorithms for computing Levenshtein distance and sequence alignments. Following, we describe how those algorithms led to heuristics employed in the most widely used software in bioinformatics, BLAST, a program to search DNA and protein databases for evolutionarily relevant similarities. More recently, the advent of modern genomic sequencing and the volume of data it generates has resulted in a return to the problem of local alignment. We conclude with how the mathematical formulation of Levenshtein distance as a metric made possible additional optimizations to similarity search in biological contexts. These modern optimizations are built around the low metric entropy and fractional dimensionality of biological databases, enabling orders of magnitude acceleration of biological similarity search.

机译：Levenshtein编辑距离已经播放了一个核心角色 - 尤其是过去的序列对齐和生物数据库相似性搜索。我们使用动态编程算法的历史记录来计算用于计算Levenshtein距离和序列对齐的历史记录。以下，我们描述了这些算法如何导致在生物信息学中最广泛使用的软件中采用的启发式，爆炸，用于搜索DNA和蛋白质数据库的程序以进行进化相关的相似之处。最近，现代基因组测序的出现和它生成的数据量导致返回局部对齐问题。我们与度量的数学制定作为度量的数学制定如何在生物背景下对相似性搜索产生额外的优化。这些现代优化围绕生物数据库的低度量熵和分数维度，使生物相似性搜索的数量级加速。

著录项

来源
《IEEE Transactions on Information Theory》 |2021年第6期|3287-3294|共8页
作者
Berger Bonnie; Waterman Michael S.; Yu Yun William;
展开▼
作者单位

MIT Dept Math & Elect Engn & Comp Sci 77 Massachusetts Ave Cambridge MA 02139 USA|MIT Dept Comp Sci 77 Massachusetts Ave Cambridge MA 02139 USA|MIT AI Lab Cambridge MA 02139 USA;

Univ Southern Calif Dept Biol Sci Quantitat & Computat Biol Sect Los Angeles CA 90089 USA;

Univ Toronto Dept Math Toronto ON M5S 2E4 Canada|Univ Toronto Scarborough Dept Comp & Math Sci Toronto ON M1C 1A4 Canada;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Heuristic algorithms; Biological information theory; Databases; Dynamic programming; Measurement; Bioinformatics; Levenshtein distance; sequence comparison; dynamic programming; similarity search; metric entropy;

机译：启发式算法;生物信息理论;数据库;动态规划;测量;生物信息学;Levenshtein距离;序列比较;动态编程;相似性搜索;度量熵;

相似文献

外文文献
中文文献
专利

1. A method based on the Levenshtein distance metric for the comparison of multiple movement patterns described by matrix sequences of different length [J] . Beernaerts Jasper, Debever Ellen, Lenoir Matthieu, Expert Systems with Application . 2019,第JANa期

机译：一种基于Levenshtein距离度量的方法，用于比较由不同长度的矩阵序列描述的多个运动模式
2. Comparison of Apache SOLR Search Spellcheck String Distance Measure – Levenshtein, Jaro Winkler, and N-Gram [J] . Parameswara Rao Kandregula International Journal of Computer Trends and Technology . 2021,第3期

机译：Apache Solr搜索SpellCheck String测量 - Levenshtein，Jaro Winkler和N-Gram的比较
3. Travel Time Measurement by Vehicle Sequence Matching Method - Evaluation of Vehicle Sequences using Levenshtein Distance [J] . Satoshi Takahashi, Takashi Izumi 日本大学理工学研究所所报 . 2007,第1期

机译：通过车辆序列匹配方法测量行驶时间-使用Levenshtein距离评估车辆序列
4. Efficient computation of the Damerau-Levenshtein distance between biological sequences [C] . Chunchun Zhao, Sartaj Sahni IEEE International Conference on Computational Advances in Bio and Medical Sciences . 2017

机译：高效计算生物序列之间的Damerau-Levenshtein距离
5. Sequence and structure similarity search in biological and XML databases. [D] . Aghili, S. Alireza. 2005

机译：生物和XML数据库中的序列和结构相似性搜索。
6. Life science database cross search: A single window system for dispersed biological databases [O] . Jun-ichi Onami, Hideki Hatanaka, Shoko Kawamoto, 2019

机译：生命科学数据库交叉搜索：用于分散生物学数据库的单一窗口系统
7. Autocorrect on Drugs e-Dictionary Search Module Using Levenshtein Distance Algorithm [O] . Halimah Tus Sadiah 2020

机译：使用Levenshtein距离算法在药物电子词典搜索模块上自动更正

Levenshtein Distance, Sequence Comparison and Biological Database Search

摘要

著录项

相似文献

相关主题

期刊订阅