首页> 外文会议>International conference on Very Large Data Bases >Searching Large Lexicons for Partially Specified Terms using Compressed Inverted Files
【24h】

Searching Large Lexicons for Partially Specified Terms using Compressed Inverted Files

机译:使用压缩反转文件搜索局部指定的术语的大型词汇

获取原文

摘要

There are many advantages to be gained by storing the lexicon of a full text database in main memory. In this paper we describe how to use a compressed inverted file index to search such a lexicon for entries that match a pattern or partially specified term. This method provides an effective compromise between speed and space, running orders of magnitude faster than brute force search, but requiring less memory than other pattern-matching data structures; indeed, in some cases requiring less memory than would be consumed by a single pointer to each string. The pattern search method is based on text indexing techniques and is a successful adaptation of inverted files to main memory databases.
机译:通过将全文数据库的词典存储在主内存中,可以获得许多优点。在本文中,我们介绍如何使用压缩的反转文件索引来搜索与模式或部分指定术语匹配的条目的此类词汇。该方法在速度和空间之间提供有效的折衷,比蛮力搜索更快地运行数量级,但需要更少的内存而不是其他模式匹配数据结构;实际上,在某些情况下,需要较少的内存,而不是由每个字符串用单个指针消耗的内存。模式搜索方法基于文本索引技术,并且是将反转文件的成功适应主存储器数据库。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号