【24h】

On Hardness of Jumbled Indexing

机译:论混杂索引的难度

获取原文

摘要

Jumbled indexing is the problem of indexing a text T for queries that ask whether there is a substring of T matching a pattern represented as a Parikh vector, i.e., the vector of frequency counts for each character. Jumbled indexing has garnered a lot of interest in the last four years; for a partial list see. There is a naive algorithm that preprocesses all answers in O(n~2|Σ|) time allowing quick queries afterwards, and there is another naive algorithm that requires no preprocessing but has O(n log |Σ|) query time. Despite a tremendous amount of effort there has been little improvement over these running times. In this paper we provide good reason for this. We show that, under a 3SUM-hardness assumption, jumbled indexing for alphabets of size ω(1) requires Ω(n~(2-∈)) preprocessing time or Ω(n~(1-δ)) query time for any ∈, δ > 0. In fact, under a stronger 3SUM-hardness assumption, for any constant alphabet size r > 3 there exist describable fixed constant ∈_r and δ_r such that jumbled indexing requires Ω(n~(2-∈_r)) preprocessing time or Ω(n~(1-δ-r)) query time.
机译:混乱的索引编制是为查询文本T编制索引的问题,这些查询询问是否存在T的子字符串与表示为Parikh向量(即每个字符的频率计数向量)的模式匹配。在过去的四年中,混乱的索引引起了很多关注。有关部分列表,请参见。有一种天真的算法可以在O(n〜2 |Σ|)时间内对所有答案进行预处理,从而可以在以后进行快速查询;还有另一种天真的算法不需要进行预处理,但查询时间为O(n log |Σ|)。尽管付出了巨大的努力,但在这些运行时间上并没有什么改善。在本文中,我们为此提供了充分的理由。我们显示,在3SUM硬度假设下,大小为ω(1)的字母的混杂索引要求Ω(n〜(2-∈))预处理时间或Ω(n〜(1-δ))任意查询时间,δ>0。实际上,在更强的3SUM硬度假设下,对于任何恒定的字母大小r> 3,都存在可描述的固定常数∈_r和δ_r,使得混杂索引需要Ω(n〜(2-∈_r))预处理时间或Ω(n〜(1-δ-r))查询时间。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号