首页> 外文会议>International Conference on Future Computer and Communication;ICFCC >Efficient Maintenance Scheme of Inverted Index for Large-scale Full-Text Retrieval
【24h】

Efficient Maintenance Scheme of Inverted Index for Large-scale Full-Text Retrieval

机译:大规模全文检索的倒排索引高效维护方案

获取原文
获取外文期刊封面目录资料

摘要

Inverted index is the mainstay of modern full-text retrieval systems, and it is a promising way to improve time and space efficiencies with appropriately maintenance scheme of inverted files for huge amount of information management and retrieval. In order to improve the retrieval performance of inverted index in large-scale fulltext systems, a time and space efficient random access blocked inverted index (RABI) and an efficient dynamic maintenance scheme (DMS) are proposed in this paper. RABI divides inverted list into blocks and compresses different part of each block with the corresponding compression method to decrease space consumption. Based on RABI, DMS distinguishes between long and short posting lists. Then short posting lists are updated by remerge strategy and long posting lists are updated by hybrid in-place and remerge strategy. Experimental results show that, compared with existed schemes, the proposed scheme greatly averagely reduces space cost, conjunctive Boolean query time, and the cost of on-line index construction.
机译:倒排索引是现代全文检索系统的主体,它是一种通过适当维护倒排文件维护方案来提高海量时空效率的有前途的方式,可以进行大量的信息管理和检索。为了提高大规模全文系统中倒排索引的检索性能,提出了一种时空高效的随机访问阻塞倒排索引(RABI)和有效的动态维护方案(DMS)。 RABI将倒排列表分成多个块,并使用相应的压缩方法压缩每个块的不同部分,以减少空间消耗。基于RABI,DMS可以区分长发清单和短发清单。然后,通过重新合并策略更新简短的发布列表,并通过就地和重新合并混合策略更新长的发布列表。实验结果表明,与现有方案相比,该方案平均降低了空间成本,联合布尔查询时间和在线索引构建成本。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号