首页> 外文会议>String Processing and Information Retrieval; Lecture Notes in Computer Science; 4209 >Structured Index Organizations for High-Throughput Text Querying
【24h】

Structured Index Organizations for High-Throughput Text Querying

机译:高通量文本查询的结构化索引组织

获取原文
获取原文并翻译 | 示例

摘要

Inverted indexes are the preferred mechanism for supporting content-based queries in text retrieval systems, with the various data items usually stored compressed in some way. But different query modalities require that different information be held in the index. For example, phrase querying requires that word offsets be held as well as document numbers. In this study we describe an inverted index organization that provides efficient support for all of conjunctive Boolean queries, ranked queries, and phrase queries. Experimental results on a 426 GB document collection show that the methods we describe provide fast evaluation of all three querying modes.
机译:反向索引是在文本检索系统中支持基于内容的查询的首选机制,通常以某种方式压缩存储的各种数据项。但是不同的查询方式要求在索引中保存不同的信息。例如,短语查询要求保留单词偏移量以及文档编号。在这项研究中,我们描述了一个倒排索引组织,该组织为所有联合布尔查询,排名查询和短语查询提供有效的支持。在426 GB的文档集上的实验结果表明,我们描述的方法可以对所有三种查询模式进行快速评估。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号