首页> 外文会议>International Colloquium on Theoretical Aspects of Computing >Optimally Streaming Greedy Regular Expression Parsing

【24h】

Optimally Streaming Greedy Regular Expression Parsing

机译：最佳流贪婪的贪婪常规表达式解析

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

We study the problem of streaming regular expression parsing: Given a regular expression and an input stream of symbols, how to output a serialized syntax tree representation as an output stream during input stream processing. We show that optimally streaming regular expression parsing, outputting bits of the output as early as is semantically possible for any regular expression of size m and any input string of length n, can be performed in time O(2~(m log m) + mn) on a unit-cost random-access machine. This is for the wide-spread greedy disambiguation strategy for choosing parse trees of grammatically ambiguous regular expressions. In particular, for a fixed regular expression, the algorithm's run-time scales linearly with the input string length. The exponential is due to the need for preprocessing the regular expression to analyze state coverage of its associated NFA, a PSPACE-hard problem, and tabulating all reachable ordered sets of NFA-states. Previous regular expression parsing algorithms operate in multiple phases, always requiring processing or storing the whole input string before outputting the first bit of output, not only for those regular expressions and input prefixes where reading to the end of the input is strictly necessary.

机译：我们研究流媒体常规表达式解析的问题：给定正则表达式和符号输入流，如何在输入流处理期间将序列化语法树表示输出为输出流。我们表明，最佳流媒体正则表达式解析，尽可能早地输出输出的比特，以便在时间o（2〜（m log m）+的时间o（2〜（m log m）+上执行MN）在单位成本随机接入机上。这是为了广泛传播的贪婪歧义策略，用于选择语法模糊正则表达式的解析树。特别是，对于固定的正则表达式，算法的运行时间与输入字符串长度线性缩放。指数是由于需要预处理正则表达式来分析其相关NFA的状态覆盖，PSPACE难以解决和制表所有可达有序的NFA状态。以前的正则表达式解析算法以多个阶段运行，始终要求在输出第一位输出之前处理或存储整个输入串，不仅适用于那些正则表达式和输入前缀，其中严格必要。

著录项

来源
《International Colloquium on Theoretical Aspects of Computing 》|2014年||共17页
会议地点
作者
Niels Bjorn Bugge Grathwohl; Fritz Henglein; Ulrik Terp Rasmussen;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP3-53;
关键词

相似文献

外文文献
中文文献
专利

1. Note on the greedy parsing optimality for dictionary-based text compression [J] . Maxime Crochemore, Alessio Langiu, Filippo Mignosi Theoretical computer science . 2014 ,第Null期

机译：有关基于字典的文本压缩的贪婪解析最优性的说明
2. A deterministic parsing algorithm for ambiguous regular expressions [J] . Borsotti Angelo, Breveglieri Luca, Crespi Reghizzi Stefano, Acta Informatica . 2021 ,第3期

机译：模糊常规表达式的确定性解析算法
3. DotStar: breaking the scalability and performance barriers in parsing regular expressions [J] . Davide Pasetto, Fabrizio Petrini, Virat Agarwal Computer science . 2010 ,第1a2期

机译：DotStar：打破正则表达式解析的可伸缩性和性能障碍
4. Optimally Streaming Greedy Regular Expression Parsing [C] . Niels Bjorn Bugge Grathwohl, Fritz Henglein, Ulrik Terp Rasmussen International colloquium on theoretical aspects of computing . 2014

机译：最佳流式贪婪正则表达式解析
5. Beyond regular: Pattern matching with extended regular expressions. [D] . Carle, Benjamin. 2010

机译：超越正则：与扩展正则表达式匹配的模式。
6. Parsing Neurodynamic Information Streams to Estimate the Frequency Magnitude and Duration of Team Uncertainty [O] . Ronald H. Stevens, Trysha L. Galloway 2021

机译：解析神经动力学信息流以估计团队不确定性的频率大小和持续时间
7. Note on the greedy parsing optimality for dictionary-based text compression [O] . Crochemore, Maxime, Langiu, Alessio, Mignosi, Filippo 2014

机译：有关基于字典的文本压缩的贪婪解析最佳化的注意事项
8. Generation of an Output Regular Expression of a Sequential Machine with a Specified Input Regular Expression [R] . Yau, S. S. 1966

机译：具有指定输入正则表达式的顺序机器的输出正则表达式的生成

Optimally Streaming Greedy Regular Expression Parsing

摘要

著录项

相似文献

相关主题

期刊订阅