首页> 外文期刊>International Journal of Data Warehousing and Mining >Parallel Real-Time OLAP on Multi-Core Processors
【24h】

Parallel Real-Time OLAP on Multi-Core Processors

机译:多核处理器上的并行实时OLAP

获取原文
获取原文并翻译 | 示例
       

摘要

One of the most powerful and prominent technologies for knowledge discovery in decision support systems is online analytical processing (OLAP). Most of the traditional OLAP research, and most of the commercial systems, follow the static data cube approach proposed by Gray et.al. and materialize all or a subset of the cuboids of the data cube in order to ensure adequate query performance. Practitioners have called for some time for a real-time OLAP approach where the OLAP system gets updated instantaneously as new data arrives and always provides an up-to-date data warehouse for the decision support process. However, a major problem for real-time OLAP is the significant performance issues with large scale data warehouses. The aim of our research is to address these problems through the use of efficient parallel computing methods. In this paper, we present a parallel real-time OLAP system for multi-core processors. To our knowledge, this is the first real-time OLAP system that has been parallelized and optimized for contemporary multi-core architectures. Our system allows for multiple insert and multiple query transactions to be executed in parallel and in real-time. We evaluated our method for a multitude of scenarios (different ratios of insert and query transactions, query transactions with different amounts of data aggregation, different database sizes, etc.), using the TPCDS "Decision Support" benchmark data set. As multi-core test platforms, we used an Intel Sandy Bridge processor with 4 cores (8 hardware supported threads) and an Intel Xeon Westmere processor with 20 cores (40 hardware supported threads). The tests demonstrate that, with increasing number of processor cores, our parallel system achieves close to linear speedup in transaction response time and transaction throughput. On the 20 core architecture we achieved, for a 100 GB database, a better than 0.25 second query response time for real-time OLAP queries that aggregate 25% of the database. Since hardware performance improvements are currently, and in the foreseeable future, achieved not by faster processors but by increasing the number of processor cores, our new parallel real-time OLAP method has the potential to enable OLAP systems that operate in real-time on large databases.
机译:在线分析处理(OLAP)是决策支持系统中用于发现知识的最强大,最杰出的技术之一。大多数传统的OLAP研究和大多数商业系统都遵循Gray等人提出的静态数据立方体方法。并实例化数据立方体的所有或全部长方体,以确保足够的查询性能。从业人员要求使用实时OLAP方法一段时间,该方法会在新数据到达时立即更新OLAP系统,并始终为决策支持流程提供最新的数据仓库。但是,实时OLAP的主要问题是大型数据仓库的重大性能问题。我们研究的目的是通过使用有效的并行计算方法来解决这些问题。在本文中,我们提出了一种用于多核处理器的并行实时OLAP系统。据我们所知,这是第一个针对现代多核体系结构进行并行化和优化的实时OLAP系统。我们的系统允许并行和实时执行多个插入和多个查询事务。我们使用TPCDS“决策支持”基准数据集针对多种场景(插入和查询事务的不同比率,具有不同数据聚合量的查询事务,不同的数据库大小等)评估了我们的方法。作为多核测试平台,我们使用了具有4个核心(8个硬件支持的线程)的Intel Sandy Bridge处理器和具有20个核心(40个硬件支持的线程)的Intel Xeon Westmere处理器。测试表明,随着处理器内核数量的增加,我们的并行系统在事务响应时间和事务吞吐量方面实现了接近线性的加速。对于100 GB的数据库,在20个核心体系结构上,对于汇总25%的数据库的实时OLAP查询,查询响应时间要优于0.25秒。由于当前和可预见的将来,硬件性能的提高不是通过更快的处理器而是通过增加处理器内核的数量来实现的,因此我们新的并行实时OLAP方法有潜力使OLAP系统能够在大型设备上实时运行数据库。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号