首页> 外文会议>International Conference on String Processing and Information Retrieval >Structured Index Organizations for High-Throughput Text Querying
【24h】

Structured Index Organizations for High-Throughput Text Querying

机译:用于高吞吐量文本查询的结构化索引组织

获取原文

摘要

Inverted indexes are the preferred mechanism for supporting content-based queries in text retrieval systems, with the various data items usually stored compressed in some way. But different query modalities require that different information be held in the index. For example, phrase querying requires that word offsets be held as well as document numbers. In this study we describe an inverted index organization that provides efficient support for all of conjunctive Boolean queries, ranked queries, and phrase queries. Experimental results on a 426 GB document collection show that the methods we describe provide fast evaluation of all three querying modes.
机译:反相索引是用于支持文本检索系统中基于内容的查询的优选机制,其中各种数据项通常以某种方式存储压缩。但不同的查询模式要求在索引中保持不同的信息。例如,短语查询要求将单词偏移和文档编号保持偏移。在这项研究中,我们描述了一个反转的索引组织,为所有联合布尔查询,排名查询和短语查询提供有效的支持。 426 GB文件收集的实验结果表明,我们描述的方法为所有三种查询模式提供了快速评估。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号