首页> 外文会议>International conference on database systems for advanced applications >Physical Column Organization in In-Memory Column Stores
【24h】

Physical Column Organization in In-Memory Column Stores

机译:内存中列存储中的物理列组织

获取原文

摘要

Cost models are an essential part of database systems, as they are the basis of query performance optimization. Disk based systems are well understood and sophisticated models exist to compare various data structures and to estimate query costs based on disk IO operations. Cost models for in-memory databases shift the focus from disk IOs to main memory accesses and CPU costs. However, modeling memory accesses is fundamentally different and common models do not apply anymore. In this work, we examine the plan operations scan with equality selection, scan with range selection, positional lookup and insert in in-memory column stores regarding different physical column organizations. We consider uncompressed columns, bit compressed and dictionary encoded columns with sorted and unsorted dictionaries. Furthermore, we discuss tree indices on columns and dictionaries and present a detailed parameter evaluation, considering the number of distinct values, value skewness and value disorder. Finally, we present and evaluate a cost model based on cache misses for estimating the runtime of the discussed plan operations.
机译:成本模型是数据库系统的重要组成部分,因为它们是查询性能优化的基础。基于磁盘的系统已广为人知,并且存在复杂的模型来比较各种数据结构并基于磁盘IO操作估计查询成本。内存数据库的成本模型将重点从磁盘IO转移到主内存访问和CPU成本。但是,对内存访问进行建模本质上是不同的,并且通用模型不再适用。在这项工作中,我们检查了使用相等选择进行的计划操作扫描,使用范围选择进行扫描,位置查找并插入了有关不同物理列组织的内存中列存储。我们考虑未压缩列,位压缩列和字典编码列以及已排序和未排序的字典。此外,我们讨论了列和字典上的树索引,并提出了详细的参数评估,其中考虑了不同值的数量,值偏度和值无序性。最后,我们提出并评估基于缓存未命中的成本模型,以估计所讨论的计划操作的运行时间。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号