...
首页> 外文期刊>International Journal of Information and Communication Technology Research >Compression of High-dimensional Data Spaces Using Non-differential Augmented Vector Quantization
【24h】

Compression of High-dimensional Data Spaces Using Non-differential Augmented Vector Quantization

机译:使用非差分增强矢量量化的高维数据空间压缩

获取原文
           

摘要

Most data-intensive applications are confronted with the problems of I/O bottleneck, poor query processing times and space requirements. Database compression alleviates this bottleneck, reduces disk space usage, improves disk access speed, speeds up query response time, reduces overall retrieval time and increases the effective I/O bandwidth. However, random access to individual tuples in a compressed database is very difficult to achieve with most of the available compression techniques. This paper reports a lossless compression technique called non-differential augmented vector quantization. The technique is applicable to a collection of tuples and especially effective for tuples with numerous low to medium cardinality fields. In addition, the technique supports standard database operations, permits very fast random access and atomic decompression of tuples in large collections. The technique maps a database relation into a static bitmap index cached access structure. Consequently, we were able to achieve substantial savings in space by storing each database tuple as a bit value in the computer memory. Important distinguishing characteristics of our technique are that tuples can be compressed and decompressed individually rather than a full page or entire relation at a time. Furthermore, the information needed for tuple compression and decompression can reside in the memory. Possible application domains of this technique include decision support systems, statistical and life databases with low cardinality fields and possibly no text fields.
机译:大多数数据密集型应用程序都面临I / O瓶颈,较差的查询处理时间和空间要求的问题。数据库压缩可缓解此瓶颈,减少磁盘空间使用,提高磁盘访问速度,加快查询响应时间,减少总体检索时间,并增加有效的I / O带宽。但是,使用大多数可用的压缩技术很难实现对压缩数据库中各个元组的随机访问。本文报道了一种称为无差增强矢量量化的无损压缩技术。该技术适用于元组的集合,特别适用于具有众多低到中基数字段的元组。另外,该技术支持标准数据库操作,允许非常快速的随机访问和对大型集合中元组的原子解压缩。该技术将数据库关系映射到静态位图索引缓存的访问结构中。因此,通过将每个数据库元组作为位值存储在计算机内存中,我们能够节省大量空间。我们的技术的重要区别特征是元组可以单独进行压缩和解压缩,而不是一次压缩整个页面或整个关系。此外,元组压缩和解压缩所需的信息可以驻留在内存中。该技术可能的应用领域包括决策支持系统,具有低基数字段且可能没有文本字段的统计和生命数据库。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号