【24h】

TB±tree: Index structure for Information Retrieval Systems

机译:TB±tree:信息检索系统的索引结构

获取原文
获取原文并翻译 | 示例

摘要

Information Retrieval Systems (IR) is using different indexing techniques to retrieve information such as, Inverted files, and Signature files. However, Signature files are suitable for small IR systems due to its slow response, while inverted file have better response time but its space overhead is high. Moreover, inverted files use B±trees for single-word queries. In this paper, a new indexing structure called TB±tree to be used in the design of inverted files for large information retrieval systems. The TB±tree is a variant of the B±tree that supports single key-word queries and phrase queries efficiently. In TB±tree algorithms which represent each key-word stored in the index by a numeric value, and this numeric value can be used as encryption and inforce security. The numeric value for each keyword is stored in binary format, which may reduce the size of the index file by 19%. The records in TB±tree may be of variable length.
机译:信息检索系统(IR)使用不同的索引技术来检索信息,例如反转文件和签名文件。但是,签名文件由于其响应速度较慢而适用于小型IR系统,而反向文件的响应时间更好,但其空间开销很高。此外,反向文件使用B±树进行单字查询。本文提出了一种新的索引结构TB±tree,用于大型信息检索系统的倒排文件设计。 TB±树是B±树的一种变体,它有效地支持单个关键字查询和短语查询。在TB±tree算法中,该算法用一个数字值表示存储在索引中的每个关键字,该数字值可用作加密并增强安全性。每个关键字的数字值以二进制格式存储,这可能会使索引文件的大​​小减少19%。 TB±tree中的记录可能具有可变长度。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号