...
首页> 外文期刊>Knowledge and Data Engineering, IEEE Transactions on >HM: A Column-Oriented MapReduce System on Hybrid Storage
【24h】

HM: A Column-Oriented MapReduce System on Hybrid Storage

机译:HM:混合存储上的面向列的MapReduce系统

获取原文
获取原文并翻译 | 示例

摘要

The solid-state hybrid drive (SSHD) incorporates a small NAND flash memory into a hard drive, resulting in an integrated device with combined Hard Disk Drive (HDD ) and Solid State Disk (SSD) storage. By identifying the data highly associated with the performance and buffering them in the SSD part, SSHD can deliver a better performance than the standard hard drive. However, that requires a significant redesign for existing data processing systems. In this paper, we examine the problem of efficiently processing relational data using MapReduce on a cluster using SSHDs as the underlying storage devices. We present the design of Hybrid MapReduce ( ), a column-oriented MapReduce system, which adopts different storage layout, query optimizer, data index, and compression algorithm from previous MapReduce systems. In , the Distributed File System (DFS ) is deployed on SSHDs, and data layout (how data chunks are disseminated to HDDs and SSDs) plays a key role for the performance. Hence, an approximate algorithm is used to tune the data layout adaptively to maximize the query performance. We evaluate using TPC-H benchmark and the results show that with our new design, the hybrid system can provide a similar performance as the SSD-only system.
机译:固态混合驱动器(SSHD)将小的NAND闪存集成到硬盘驱动器中,从而形成了具有组合的硬盘驱动器(HDD)和固态磁盘(SSD)存储的集成设备。通过识别与性能高度相关的数据并将其缓存在SSD部分中,SSHD可以提供比标准硬盘更好的性能。但是,这需要对现有的数据处理系统进行重大的重新设计。在本文中,我们研究了在使用SSHD作为底层存储设备的群集上使用MapReduce有效处理关系数据的问题。我们提出了面向列的MapReduce系统Hybrid MapReduce()的设计,该系统采用了与以前的MapReduce系统不同的存储布局,查询优化器,数据索引和压缩算法。在中,分布式文件系统(DFS)部署在SSHD上,并且数据布局(如何将数据块分发到HDD和SSD上)对性能起着关键作用。因此,可以使用一种近似算法来自适应地调整数据布局,以最大化查询性能。我们使用TPC-H基准进行评估,结果表明,通过我们的新设计,混合系统可以提供与仅SSD系统相似的性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号