首页> 外文OA文献 >Bit-Sliced Signature Files for Very Large Text Databases on a Parallel Machine Architecture
【2h】

Bit-Sliced Signature Files for Very Large Text Databases on a Parallel Machine Architecture

机译:并行计算机体系结构上的超大型文本数据库的位切片签名文件

摘要

Free text retrieval is an important problem which can significantly benefit from a parallel architecture. Signature methods have been proposed to answer text retrieval queries in parallel machines [Sta88, LF92], under the assumption that the main memory is sufficient to hold the entire signature file. We propose the use of a Parallel Bit-Sliced Signature File method on a SIMD machine architecture when the size of the signature file exceeds the available memory. We propose that we need not examine all the bit slices; instead we use a partial fetch slice swapping algorithm. This method achieves graceful performance degradation according to the database size. We provide formulae for the optimal number of signature slices to fetch and match with the query signature. Arithmetic examples show that our method can handle a 128GB database with a 2sec response time on a machine with the characteristics of the Connection Machine.
机译:自由文本检索是一个重要的问题,可以从并行体系结构中受益匪浅。在主存储器足以容纳整个签名文件的假设下,已经提出了签名方法来回答并行计算机中的文本检索查询[Sta88,LF92]。当签名文件的大小超过可用内存时,我们建议在SIMD机器体系结构上使用并行位切片签名文件方法。我们建议我们不必检查所有的位片。相反,我们使用部分获取切片交换算法。根据数据库的大小,此方法会导致性能下降。我们提供了用于获取和与查询签名匹配的最佳签名切片数量的公式。算术示例表明,我们的方法可以在具有Connection Machine特性的计算机上处​​理具有2秒响应时间的128GB数据库。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号