首页> 外文OA文献 >Fast forward index methods for pseudo-relevance feedback retrieval
【2h】

Fast forward index methods for pseudo-relevance feedback retrieval

机译:伪相关反馈检索的快进索引方法

摘要

The inverted index is the dominant indexing method in information retrieval systems. It enables fast return of the list of all documents containing a given query term. However, for retrieval schemes involving query expansion, as in pseudo-relevance feedback (PRF), the retrieval time based on an inverted index increases linearly with the number of expansion terms. In this regard, we have examined the use of a forward index, which consists of the mapping of each document to its constituent terms. We propose a novel forward index-based reranking scheme to shorten the PRF retrieval time. In our method, a first retrieval of the original query is performed using an inverted index, and then a forward index is employed for the PRF part. We have studied several new forward indexes, including using a novel spstring data structure and the weighted variable bit-block compression (wvbc) signature. With modern hardware such as solid-state drives (SSDs) and sufficiently large main memory, forward index methods are particularly promising. We find that with the whole index stored in main memory, PRF retrieval using a spstring or wvbc forward index excels in time efficiency over an inverted index, being able to obtain the same levels of performance measures at shorter times.
机译:倒排索引是信息检索系统中的主要索引方法。它可以快速返回包含给定查询词的所有文档的列表。但是,对于涉及查询扩展的检索方案,例如在伪相关反馈(PRF)中,基于反向索引的检索时间随扩展项的数量线性增加。在这方面,我们检查了前向索引的使用,该索引由每个文档与其组成术语的映射组成。我们提出了一种新颖的基于前向索引的重排序方案,以缩短PRF检索时间。在我们的方法中,首先使用反向索引对原始查询进行检索,然后将前向索引用于PRF部分。我们研究了几种新的前向索引,包括使用新颖的spstring数据结构和加权可变位块压缩(wvbc)签名。使用诸如固态驱动器(SSD)和足够大的主存储器之类的现代硬件,前向索引方法特别有前途。我们发现,将整个索引存储在主存储器中,使用spstring或wvbc正向索引进行PRF检索比反向索引在时间效率方面表现出色,能够在更短的时间内获得相同级别的性能指标。

著录项

  • 作者

    Dang EKF; Luk RWP; Allan J;

  • 作者单位
  • 年度 2015
  • 总页数
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类

相似文献

  • 外文文献
  • 中文文献
  • 专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号