Filtering Techniques for Regular Expression Matching in Strings

机译：字符串中正则表达式匹配的过滤技术

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

Matching a regular expression (regex) on a text is widely used in many applications, such as text editing, information extraction and instruction detection (IDS). Traditional algorithms generally compile an equivalent automaton from the regex query, then run it on the text to find all matching results. However, they have to scale linearly with the size of the text. Recent algorithms utilize various filtering techniques to quickly jump to candidate positions in a text where a matching result may appear, then only these candidate positions are verified by the automaton. In this paper, we give a full specification on filtering techniques for the regex matching problem, in which filters for the regex query can be classified into positive factor and negative factor. We review three typical positive factors, including prefix, suffix, and necessary factor and show that negative factors can collaborate with positive factors to significantly improve the filtering ability.

机译：在文本编辑，信息提取和指令检测（IDS）等许多应用程序中广泛使用在文本上匹配正则表达式（regex）。传统算法通常会从正则表达式查询中编译等效的自动机，然后在文本上运行它以查找所有匹配的结果。但是，它们必须随文本大小线性缩放。最近的算法利用各种过滤技术来快速跳到文本中可能出现匹配结果的候选位置，然后仅这些候选位置由自动机验证。在本文中，我们对正则表达式匹配问题的过滤技术给出了完整的规范，其中用于正则表达式查询的过滤器可以分为正因子和负因子。我们回顾了三个典型的积极因素，包括前缀，后缀和必要因素，并表明消极因素可以与积极因素协作以显着提高过滤能力。

著录项

来源
《Database systems for advanced applications》|2018年|118-122|共5页
会议地点 Gold Coast(AU)
作者
Tao Qiu; Xiaochun Yang; Bin Wang;
展开▼
作者单位

School of Computer Science and Engineering, Northeastern University, Liaoning 110819, China;

School of Computer Science and Engineering, Northeastern University, Liaoning 110819, China;

School of Computer Science and Engineering, Northeastern University, Liaoning 110819, China;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类
关键词
Regular expression; Filtering technique; Query efficiency;

机译：正则表达式;过滤技术；查询效率;

相似文献

外文文献
中文文献
专利

1. Improved approximate string matching and regular expression matching on Ziv-Lempel compressed texts [J] . Bille P., Fagerberg R., G?rtz I.L. ACM transactions on algorithms . 2010,第1期

机译：在Ziv-Lempel压缩文本上改进了近似字符串匹配和正则表达式匹配
2. String matching algorithm for extended regular expressions [J] . Ryuji Ichikawa, Hiroaki Yamamoto 電子情報通信学会技術研究報告. コンピュテ-ション. Theoretical Foundations of Computing . 2001,第708期

机译：扩展正则表达式的字符串匹配算法
3. String matching algorithm for extended regular expressions [J] . Ryuji Ichikawa, Hiroaki Yamamoto 電子情報通信学会技術研究報告. コンピュテ-ション. Theoretical Foundations of Computing . 2001,第708期

机译：扩展正则表达式的字符串匹配算法
4. Filtering Techniques for Regular Expression Matching in Strings [C] . Tao Qiu, Xiaochun Yang, Bin Wang International Conference on Database Systems for Advanced Applications . 2018

机译：串行中正则表达式匹配的过滤技术
5. Beyond regular: Pattern matching with extended regular expressions. [D] . Carle, Benjamin. 2010

机译：超越正则：与扩展正则表达式匹配的模式。
6. Fingerprints Recognition System-Based on Mobile Device Identification Using Circular String Pattern Matching Techniques [O] . Miznah H. Alshammary, Costas S. Iliopoulos, Mujibur R. Khan -1

机译：基于环形字符串匹配技术的移动设备识别的指纹识别系统
7. A Parallel Automaton String Matching with Pre-Hashing and Root-Indexing Techniques for Content Filtering Coprocessor [O] . Kuo-kun Tseng, Ying-dar Lin, Tsern-huei Lee, 2008

机译：内容过滤协处理器的预哈希和根索引技术的并行自动机字符串匹配
8. Algorithms for Finding an Optimal Matching Between a Given String and a StringGenerated by a Regular Grammar [R] . Baas, S. M., Vanschaik, P. 1990

机译：寻找给定字符串与由常规语法生成的string之间的最佳匹配的算法

Filtering Techniques for Regular Expression Matching in Strings

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅