On Hardness of Jumbled Indexing

机译：论混杂索引的难度

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Jumbled indexing is the problem of indexing a text T for queries that ask whether there is a substring of T matching a pattern represented as a Parikh vector, i.e., the vector of frequency counts for each character. Jumbled indexing has garnered a lot of interest in the last four years; for a partial list see. There is a naive algorithm that preprocesses all answers in O(n~2|Σ|) time allowing quick queries afterwards, and there is another naive algorithm that requires no preprocessing but has O(n log |Σ|) query time. Despite a tremendous amount of effort there has been little improvement over these running times. In this paper we provide good reason for this. We show that, under a 3SUM-hardness assumption, jumbled indexing for alphabets of size ω(1) requires Ω(n~(2-∈)) preprocessing time or Ω(n~(1-δ)) query time for any ∈, δ ＞ 0. In fact, under a stronger 3SUM-hardness assumption, for any constant alphabet size r > 3 there exist describable fixed constant ∈_r and δ_r such that jumbled indexing requires Ω(n~(2-∈_r)) preprocessing time or Ω(n~(1-δ-r)) query time.

机译：混乱的索引编制是为查询文本T编制索引的问题，这些查询询问是否存在T的子字符串与表示为Parikh向量（即每个字符的频率计数向量）的模式匹配。在过去的四年中，混乱的索引引起了很多关注。有关部分列表，请参见。有一种天真的算法可以在O（n〜2 |Σ|）时间内对所有答案进行预处理，从而可以在以后进行快速查询;还有另一种天真的算法不需要进行预处理，但查询时间为O（n log |Σ|）。尽管付出了巨大的努力，但在这些运行时间上并没有什么改善。在本文中，我们为此提供了充分的理由。我们显示，在3SUM硬度假设下，大小为ω（1）的字母的混杂索引要求Ω（n〜（2-∈））预处理时间或Ω（n〜（1-δ））任意查询时间，δ＞0。实际上，在更强的3SUM硬度假设下，对于任何恒定的字母大小r> 3，都存在可描述的固定常数∈_r和δ_r，使得混杂索引需要Ω（n〜（2-∈_r））预处理时间或Ω（n〜（1-δ-r））查询时间。

著录项

来源
《International colloquium on automata, languages and programming》|2014年|114-125|共12页
会议地点
作者
Amihood Amir; Timothy M. Chan; Moshe Lewenstein; Noa Lewenstein;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. Algorithms for Jumbled Indexing, Jumbled Border and Jumbled Square on run-length encoded strings [J] . Amir Amihood, Apostolico Alberto, Hirst Tirza, Theoretical computer science . 2016,第Pta2期

机译：游程编码字符串上的混杂索引，混杂边界和混杂平方的算法
2. Fast and Simple Jumbled Indexing for Binary Run-Length Encoded Strings [J] . Lu{i}s Cunha, Simone Dantas, Travis Gagie, LIPIcs : Leibniz International Proceedings in Informatics . 2017,第30期

机译：用于二进制运行长度编码字符串的快速和简单的混乱索引
3. On hardness of several string indexing problems [J] . Larsen Kasper Green, Munro J. Ian, Nielsen Jesper Sindahl, Theoretical computer science . 2015,第Null期

机译：关于几个字符串索引问题的硬度
4. On Hardness of Jumbled Indexing [C] . Amihood Amir, Timothy M. Chan, Moshe Lewenstein, International Colloquium on Automata, Languages, and Programming;ICALP 2014 . 2014

机译：论混乱索引的硬度
5. A geotechnical investigation into the Chaos Jumbles rockslide avalanches Lassen Volcanic National Park, California. [D] . Ninivaggi, Seth A. 2005

机译：对加利福尼亚州拉森火山国家公园的混沌杂物岩崩雪崩进行岩土工程的调查。
6. Controlled Vocabularies Indexing and Medical Language Processing. Expert Indexing Systems: Research on Interactive Knowledge-Based Indexing: The MedIndEx Prototype [O] . Susanne M. Humphrey 1989

机译：受控词汇表索引编制和医学语言处理。专家索引系统：基于交互式知识的索引的研究：MedIndEx原型
7. On hardness of jumbled indexing [O] . Amihood Amir, Timothy M. Chan, Moshe Lewenstein 2016

机译：关于混杂分度的硬度

On Hardness of Jumbled Indexing

摘要

著录项

相似文献

相关主题

期刊订阅