首页> 外文会议>International symposium on string processing and information retrieval >Order-Preserving Incomplete Suffix Trees and Order-Preserving Indexes
【24h】

Order-Preserving Incomplete Suffix Trees and Order-Preserving Indexes

机译:保留顺序的不完整后缀树和保留顺序的索引

获取原文

摘要

Recently Kubica et al. (Inf. Process. Let., 2013) and Kim et al. (submitted to Theor. Comp. Sci.) introduced order-preserving pattern matching: for a given text the goal is to find its factors having the same 'shape' as a given pattern. Known results include a linear-time algorithm for this problem (in case of polynomially-bounded alphabet) and a generalization to multiple patterns. We give an O(n log log n) time construction of an index that enables order-preserving pattern matching queries in time proportional to pattern length. The main component is a data structure being an incomplete suffix tree in the order-preserving setting. The tree can miss single letters related to branching at internal nodes. Such incompleteness results from the weakness of our so called weak character oracle. However, due to its weakness, such oracle can answer queries on-line in O(log log n) time using a sliding-window approach. For most of the applications such incomplete suffix-trees provide the same functional power as the complete ones. We also give an O(n log n/log log n) time algorithm constructing complete order-preserving suffix trees.
机译:最近Kubica等。 (Inf。Process。Let。,2013)和Kim等。 (提交给Theor。Comp。Sci。)引入了保留顺序的模式匹配:对于给定的文本,目标是找到与给定模式具有相同“形状”的因素。已知的结果包括针对该问题的线性时间算法(在以多项式为界的字母的情况下)和对多种模式的推广。我们给出了索引的O(n log log n)时间构造,该构造使按顺序保留与模式长度成比例的模式匹配查询成为可能。主要组成部分是一个数据结构,该数据结构是保留顺序设置中不完整的后缀树。该树可能会丢失与内部节点上的分支相关的单个字母。这种不完整是由于我们所谓的弱字符甲骨文的弱点造成的。但是,由于其弱点,这样的oracle可以使用滑动窗口方法在O(log log n)时间内在线回答查询。对于大多数应用程序,此类不完整的后缀树提供与完整的后缀树相同的功能。我们还给出了O(n log n / log log n)时间算法,该算法构造了完整的保留顺序的后缀树。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号