Approximate Regular Expression Matching with Multi-strings

机译：带多字符串的近似正则表达式匹配

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

In this paper, we are interested in solving the approximate regular expression matching problem: we are given a regular expression R in advance and we wish to answer the following query: given a text T and a parameter k, find all the substrings of T which match the regular expression R with at most k errors (an error consist in deleting inserting, or substituting a character). There exists a well known solution for this problem in time O(mn) where m is the size of the regular expression (the number of operators and characters appearing in R) and n the length of the text. There also exists a solution for the case k = 0 (exact regular expression matching) which solves the problem in time O(dn), where d is the number of strings in the regular expression (a string is a sequence of characters connected with concatenation operator). In this paper, we show that both methods can be combined to solve the approximate regular approximate matching problem in time O(kdn) for arbitrary k. This bound can be much better than the bound O(mn/ log_(k+2)n) achieved by the best actual regular expression matching algorithm in case d < m/(klog_(k+2)n)(that is k is not too large and R contains much less occurrences of ∪ and * than occurrences of (·)).

机译：在本文中，我们有兴趣解决近似正则表达式匹配问题：我们预先给定了正则表达式R，并希望回答以下查询：给定文本T和参数k，找到T的所有子字符串，其中将正则表达式R最多匹配k个错误（错误在于删除插入或替换字符）。对于时间O（mn），存在一个众所周知的解决方案，其中m为正则表达式的大小（R中出现的运算符和字符数），n为文本长度。对于k = 0（精确的正则表达式匹配）的情况，也存在一种解决方案，它解决了时间O（dn）的问题，其中d是正则表达式中的字符串数（字符串是与串联连接的字符序列）运算符）。在本文中，我们证明了两种方法都可以结合起来解决任意k时刻O（kdn）的近似正则近似匹配问题。在d

著录项

来源
《String processing and information retrieval》|2011年|p.55-66|共12页
会议地点 Pisa(IT);Pisa(IT)
作者
Djamal Belazzougui; Mathieu Raffinot;
展开▼
作者单位

LIAFA, Univ. Paris Diderot - Paris 7, 75205 Paris Cedex 13, France;

LIAFA, Univ. Paris Diderot - Paris 7, 75205 Paris Cedex 13, France;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类信息处理（信息加工）;
关键词

相似文献

外文文献
中文文献
专利

1. Improved approximate string matching and regular expression matching on Ziv-Lempel compressed texts [J] . Bille P., Fagerberg R., G?rtz I.L. ACM transactions on algorithms . 2010,第1期

机译：在Ziv-Lempel压缩文本上改进了近似字符串匹配和正则表达式匹配
2. A New Translation from Semi-Extended Regular Expressions into NFAs and Its application to Approximate Matching Problem [J] . Hiroaki YAMAMOTO 電子情報通信学会技術研究報告. コンピュテ-ション. Theoretical Foundations of Computing . 2003,第326期

机译：半扩展正则表达式到NFA的新转换及其在近似匹配问题中的应用
3. A New Translation from Semi-Extended Regular Expressions into NFAs and Its application to Approximate Matching Problem [J] . Hiroaki YAMAMOTO 電子情報通信学会技術研究報告. コンピュテ-ション. Theoretical Foundations of Computing . 2003,第326期

机译：从半扩展正则表达式进入NFA的新翻译及其在近似匹配问题的应用
4. Approximate Regular Expression Matching with Multi-strings [C] . Djamal Belazzougui, Mathieu Raffinot International Symposium on String Processing and Information Retrieval . 2011

机译：与多字符串匹配的近似正则表达式
5. Beyond regular: Pattern matching with extended regular expressions. [D] . Carle, Benjamin. 2010

机译：超越正则：与扩展正则表达式匹配的模式。
6. Exploring efficient grouping algorithms in regular expression matching [O] . Chengcheng Xu, Jinshu Su, Shuhui Chen 2012

机译：在正则表达式匹配中探索有效的分组算法
7. Approximate regular expression matching with multi-strings [O] . Belazzougui Djamal, Raffinot Mathieu 2013

机译：近似正则表达式与多字符串匹配
8. Learning SAS's Perl Regular Expression Matching the Easy Way: By Doing. [R] . Genovesi, P. 2015

机译：学习sas的perl正则表达式匹配简单方法：通过做。

Approximate Regular Expression Matching with Multi-strings

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅