首页> 外文会议>International Symposium on String Processing and Information Retrieval >Order-Preserving Incomplete Suffix Trees and Order-Preserving Indexes
【24h】

Order-Preserving Incomplete Suffix Trees and Order-Preserving Indexes

机译:订单保留不完整的后缀树和秩序保留索引

获取原文

摘要

Recently Kubica et al. (Inf. Process. Let., 2013) and Kim et al. (submitted to Theor. Comp. Sci.) introduced order-preserving pattern matching: for a given text the goal is to find its factors having the same 'shape' as a given pattern. Known results include a linear-time algorithm for this problem (in case of polynomially-bounded alphabet) and a generalization to multiple patterns. We give an O(n log log n) time construction of an index that enables order-preserving pattern matching queries in time proportional to pattern length. The main component is a data structure being an incomplete suffix tree in the order-preserving setting. The tree can miss single letters related to branching at internal nodes. Such incompleteness results from the weakness of our so called weak character oracle. However, due to its weakness, such oracle can answer queries on-line in O(log log n) time using a sliding-window approach. For most of the applications such incomplete suffix-trees provide the same functional power as the complete ones. We also give an O(n log n/log log n) time algorithm constructing complete order-preserving suffix trees.
机译:最近Kubica等人。 (inf。过程。让。,2013)和Kim等人。 (提交给您。COMP。SCI。)推出了订单保留模式匹配:对于给定文本,目标是找到具有相同“形状”的因素作为给定模式。已知结果包括用于该问题的线性时间算法(在多项式有界字母表的情况下)和多个模式的概括。我们提供O(n log log n)时间构造的索引,其使得能够与模式长度成比例地匹配查询的订单保留模式。主组件是数据结构是在定期保留设置中的不完整后缀树。树可能会错过与内部节点的分支相关的单个字母。这种不完整性从我们所谓的弱角色甲骨文的弱点产生。但是,由于其弱点,此类Oracle可以使用滑动窗口方法在O(日志日志N)时间内在线回答查询。对于大多数应用程序,这种不完整的后缀树提供了与完整的功能相同的功能功率。我们还提供O(n log n / log log n)时间算法构建完整的秩序保留后缀树。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号