A Tree-based Inverted File for Fast Ranked-Document Retrieval

机译：基于树的反转文件，用于快速排名 - 文档检索

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Inverted files are widely used to index documents in large-scale information retrieval systems. An inverted file consists of posting lists, which can be stored in either a document-identifier ascending order or a document-weight descending order. For an identifier-ascending-order posting list, retrieving ranked documents necessitates traversal of all postings, whereas for the weight-descending-order posting list, performing Boolean queries involves very complex processing. In this paper, we transform a posting list to a tree-based structure, called the n-key-heap posting tree, to speedup ranked-document retrieval for Boolean queries. In this structure, the orders of document identifiers and document weights are preserved simultaneously. To preserve the identifier order, the edge pointers are designed to maintain numerical order in the posting tree. To preserve the weight order, greater-weight postings are stored in higher tree nodes by the heap property. We model these criteria to a tree-construction problem and propose an efficient algorithm to construct an optimal posting tree having the minimal access time.

机译：倒置文件广泛用于索引大规模信息检索系统中的文档。反转文件由发布列表组成，可以以文档标识符升序或文档权重降序存储。对于标识符 - 升序发布列表，检索排名的文档需要遍历所有帖子，而对于执行布尔查询的重量下降订单列表涉及非常复杂的处理。在本文中，我们将发布列表转换为基于树的结构，称为n-key-eacp发布树，以加速排名 - 文档检索用于布尔查询。在这种结构中，文档标识符和文档权重的订单同时保留。为了保留标识符顺序，边缘指针旨在维持在发布树中的数字顺序。为了保持重量阶，通过堆属性将更大的帖子存储在较高的树节点中。我们将这些标准塑造到树施工问题，并提出了一种有效的算法来构造具有最小接入时间的最佳张贴树。

著录项

来源
《International conference on information and knowledge engineering》|2003年||共6页
会议地点
作者
Wann-Yun Shieh; Tien-Fu Chen; Chung-Ping Chung;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类信息与传播理论;
关键词
information retrieval; inverted file; boolean query; ranked document; posting tree;

机译：信息检索;反转文件;布尔查询;排名文件;发布树;

相似文献

外文文献
中文文献
专利

1. Fast query evaluation through document identifier assignment for inverted file-based information retrieval systems [J] . Cher-Sheng Cheng, Chung-Ping Chung, Jean Jyh-Jiun Shann Information Processing & Management . 2006,第3期

机译：通过文档标识符分配对基于反向文件的信息检索系统进行快速查询评估
2. An Inverted File Cache for Fast Information Retrieval [J] . WANN-YUN SHIEH, JEAN JYH-JIUN SHANN, CHUNG-PING CHUNG Journal of information science and engineering . 2003,第4期

机译：反向文件缓存，可快速检索信息
3. Structural image retrieval using automatic image annotation and region based inverted file [J] . Dengsheng Zhang, Md. Monirul Islam, Guojun Lu Journal of visual communication & image representation . 2013,第7期

机译：使用自动图像注释和基于区域的倒排文件检索结构图像
4. A Tree-based Inverted File for Fast Ranked-Document Retrieval [C] . Wann-Yun Shieh, Tien-Fu Chen, Chung-Ping Chung Proceedings of the International Conference on Information and Knowledge Engineering(IKE'03) . 2003

机译：基于树的倒排文件，用于快速排序文档
5. A LIST MERGING PROCESSOR FOR INVERTED FILE INFORMATION RETRIEVAL SYSTEMS. [D] . HOLLAAR, LEE ALLEN. 1975

机译：反向文件信息检索系统的列表合并处理器。
6. Tabix: fast retrieval of sequence features from generic TAB-delimited files [O] . Heng Li -1

机译：Tabix：从通用TAB分隔文件中快速检索序列特征
7. Pengembangan Algoritma Fast Inversion dalam Membentuk Inverted File untuk Text Retrieval dengan Data Skala Besar [O] . Suhartono, Derwin 2012

机译：大规模数据文本检索中形成倒文件的快速反演算法的开发
8. Inverted File Tree Machine: Efficient Multi-Key Retrieval for VLSI (Very Large Scale Integration) [R] . Kriegel, H. P. , Mannss, R. , Overmars, M. 1985

机译：反向文件树机：VLsI（超大规模集成）的高效多密钥检索

A Tree-based Inverted File for Fast Ranked-Document Retrieval

摘要

著录项

相似文献

相关主题

期刊订阅