【24h】

Compressing relations and indexes

机译:压缩关系和指数

获取原文

摘要

We propose a new compression algorithm that is tailored to database applications. It can be applied to a collection of records, and is especially effective for records with many low to medium cardinality fields and numeric fields. In addition, this new technique supports very fast decompression. Promising application domains include decision support systems (DSS), since fact tables, which are by far the largest tables in these applications, contain many low and medium cardinality fields and typically no text fields. Further, our decompression rates are faster than typical disk throughputs for sequential scans; in contrast, gzip is slower. This is important in DSS applications, which often scan large ranges of records. An important distinguishing characteristic of our algorithm, in contrast to compression algorithms proposed earlier, is that we can decompress individual tuples (even individual fields), rather than a full page (or an entire relation) at a time. Also, all the information needed for tuple decompression resides on the same page with the tuple. This means that a page can be stored in the buffer pool and used in compressed form, simplifying the job of the buffer manager and improving memory utilization. Our compression algorithm also improves index structures such as B-trees and R-trees significantly by reducing the number of leaf pages and compressing index entries, which greatly increases the fan-out. We can also use lossy compression on the internal nodes of an index.
机译:我们提出了一种新的压缩算法,用于对数据库应用程序定制。它可以应用于一系列记录,对于中等基数字段和数字字段的记录特别有效。此外,这种新技术支持非常快速的减压。承诺的应用域包括决策支持系统(DSS),因为迄今为止这些应用程序中的最大表的事实表包含许多低和中型基数字段,通常是没有文本字段。此外,我们的减压率比连续扫描的典型磁盘吞吐量更快;相比之下,GZIP慢。这在DSS应用中非常重要,这通常扫描大型记录范围。与之前提出的压缩算法相比,我们算法的一个重要的区别特征,是我们可以一次解压缩各个元组(甚至各个字段),而不是一次性页面(或整个关系)。此外,元组减压所需的所有信息都驻留在与元组的同一页面上。这意味着页面可以存储在缓冲池中并以压缩的形式使用,简化缓冲区管理器的作业并提高内存利用率。我们的压缩算法还通过减少叶页数量和压缩指数条目的数量来改善索引结构,例如B树和R树,这大大增加了扇出。我们还可以在索引的内部节点上使用有损压缩。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号