首页> 外文会议>Workshop on big data benchmarking >A Multidimensional OLAP Engine Implementation in Key-Value Database Systems
【24h】

A Multidimensional OLAP Engine Implementation in Key-Value Database Systems

机译:键值数据库系统中的多维OLAP引擎实现

获取原文

摘要

This paper tries to explore the capabilities of MapReduce-like execution engines for multidimensional data analytics through implementing a Multidimensional Online Analytical Processing (MOLAP) engine with cube model on the Hadoop ecosystem. The cube storage module converts dimension members into binary keys and leverages a novel distributed database to provide efficient storage for huge cuboids. The bit encoding sparse index is used to compress the cube data and the dimension bit encoding key with maximum members is used to achieve cube data sharding and distributed aggregation computing. We discuss how to match the star schema with cube model databases, and how to use resilient distributed data-sets for MDX-like queries executing on key-value systems. Finally, some queries of TPC-DS benchmark are adopted to validate the prototype implementation of the MOLAP engine. The results indicate that designed scenarios based on TPC-DS are suitable for various big data analytics operation benchmarking.
机译:本文试图通过在Hadoop生态系统上实现具有多维数据集模型的多维在线分析处理(MOLAP)引擎,探索类似MapReduce的执行引擎用于多维数据分析的功能。多维数据集存储模块将维成员转换为二进制密钥,并利用新颖的分布式数据库为大型立方体提供有效的存储。使用位编码稀疏索引来压缩多维数据集数据,使用具有最大成员数的维位编码密钥来实现多维数据集数据分片和分布式聚合计算。我们讨论如何将星型架构与多维数据集模型数据库匹配,以及如何对在键值系统上执行的MDX式查询使用弹性分布式数据集。最后,通过对TPC-DS基准测试的一些查询来验证MOLAP引擎的原型实现。结果表明,基于TPC-DS设计的方案适用于各种大数据分析操作基准测试。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号