首页> 外文OA文献 >Design and implementation of simple storage and query engines for a column-based and access optimized relational table structure on disk
【2h】

Design and implementation of simple storage and query engines for a column-based and access optimized relational table structure on disk

机译:为磁盘上基于列和访问优化的关系表结构设计和实现简单的存储和查询引擎

摘要

Relational database management systems currently keep large volumes of data in secondary storage. But with the increasing development of hardware features such as quantity of memory, number of CPUs and CPUs cores, computation cycles, etc. the new trend is to store data in main memory. Such storage will lead to a suited different organization of data that only is efficient if data is always available in main memory. Taking into account this idea, SAP has developed a new relational DBMS: SAP HANA. It is an implementation of in-memory database technology designed for enterprise computing. In this database it is necessary to separate data into two differenced categories: cold data and hot data. This separation is necessary to handle the increasing, but still limited capacity of main memory efficiently while keeping all data available for reporting needs. Data in a database has its own life cycle and the sequence of events that certain data has passed through determines its state. The term cold data refers to data that has become passive and can be stored in disk if it will not be changed any longer and, thus, this data will be accessed less often. In the other hand, the term hot data is used to refer data that is frequently accessed. The disadvantage with in-memory databases is the memory consumption especially for data which are rarely accessed, i.e. cold data. Therefore some mechanisms of streaming cold data in/out disk without using any kind of buffering, which allows consuming the minimal amount of memory, have been designed, implemented and evaluated. According to experimental results, the approach that consists of using a column store data model and applying some compression techniques show the best performance. ___________________________________________________________________________________________________________________________________
机译:关系数据库管理系统当前将大量数据保留在辅助存储中。但是随着硬件功能(例如内存数量,CPU和CPU内核数量,计算周期等)的不断发展,新的趋势是将数据存储在主内存中。这种存储将导致合适的不同数据组织,只有在主存储器中始终有可用数据时,这种存储才有效。考虑到这一想法,SAP开发了一种新的关系DBMS:SAP HANA。它是专为企业计算设计的内存数据库技术的实现。在此数据库中,有必要将数据分为两个不同的类别:冷数据和热数据。这种分离对于有效地处理不断增加但仍然有限的主内存容量,同时保持所有数据可用于报告需求是必需的。数据库中的数据具有其自身的生命周期,某些数据经过的事件顺序决定了其状态。术语“冷数据”指的是已变得被动的数据,如果不再进行更改,则可以将其存储在磁盘中,因此,将减少对这些数据的访问。另一方面,术语“热数据”用于表示经常访问的数据。内存数据库的缺点是特别是对于很少访问的数据(即冷数据)的内存消耗。因此,已经设计,实现和评估了一些不使用任何类型的缓冲就可以在不使用任何类型的缓冲的情况下将冷数据流入/流出磁盘的机制。根据实验结果,由使用列存储数据模型和应用一些压缩技术组成的方法显示出最佳性能。 ___________________________________________________________________________________________________________________________________

著录项

  • 作者

    Prieto Horcajo Lorena;

  • 作者单位
  • 年度 2013
  • 总页数
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类

相似文献

  • 外文文献
  • 中文文献
  • 专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号