首页> 外文学位 >Data processing using flash storage: Some opportunities and limitations.
【24h】

Data processing using flash storage: Some opportunities and limitations.

机译:使用闪存存储的数据处理:一些机会和局限性。

获取原文
获取原文并翻译 | 示例

摘要

In many data intensive workloads, I/O is a key bottleneck. In a storage hierarchy in a canonical database system, non-volatile storage devices (e.g., hard disk drives and flash solid state drives) are used as permanent data storage subsystems, whereas volatile storage devices (e.g., DRAM and CPU registers) are used to stage data from the non-volatile storage for processing by the CPU. Under this hardware architecture, non-volatile storage is connected to the rest of the system via common host I/O interfaces, such as SAS, SATA, and PCIe. Data movement cost through these I/O interfaces has become the largest performance bottleneck for many data intensive workloads. Therefore, in this thesis we explore alternative solutions to achieve high performance data processing by reducing the (expensive) data movement cost across the I/O interfaces.;In the first and second parts of this thesis, we propose a "code push down" technology to reduce the data movement cost from flash solid state drives (SSDs) to DRAM. We use the computation capability of the SSD device to push down selected database operations into the SSD devices, thereby dramatically reducing the actual data movement cost through host I/O interfaces.;Another alternative solution that we propose in the third part of this thesis is to preload necessary data (i.e., hot data) from disks to DRAM before the user query actually requests the data. In this part of thesis, we focus on how to load hot data efficiently at system restart, which could save the data movement cost at query time.;Collectively this thesis discusses some opportunities for using SSDs in data processing platforms, and develops insights about the current limitations and potential future opportunities for using the computational processing power inside SSDs to alleviate the I/O bottleneck in data intensive workloads.
机译:在许多数据密集型工作负载中,I / O是关键瓶颈。在规范数据库系统的存储层次结构中,非易失性存储设备(例如硬盘驱动器和闪存固态驱动器)用作永久数据存储子系统,而易失性存储设备(例如DRAM和CPU寄存器)用于从非易失性存储器中暂存数据以供CPU处理。在这种硬件架构下,非易失性存储通过通用主机I / O接口(例如SAS,SATA和PCIe)连接到系统的其余部分。通过这些I / O接口的数据移动成本已成为许多数据密集型工作负载的最大性能瓶颈。因此,在本文中,我们探索了通过降低跨I / O接口的(昂贵的)数据移动成本来实现高性能数据处理的替代解决方案。在本文的第一部分和第二部分,我们提出了“下推代码”减少从闪存固态驱动器(SSD)到DRAM的数据移动成本的技术。我们利用SSD设备的计算能力将选定的数据库操作下推到SSD设备中,从而显着降低了通过主机I / O接口进行数据移动的实际成本。本文第三部分提出的另一种替代解决方案是在用户查询实际请求数据之前,将必要的数据(即热数据)从磁盘预加载到DRAM。在本部分中,我们着重于如何在系统重新启动时有效地加载热数据,从而节省查询时的数据移动成本。;本文共同探讨了在数据处理平台中使用SSD的一些机会,并就此提出了见解。使用SSD内部的计算处理能力来缓解数据密集型工作负载中的I / O瓶颈的当前限制和潜在的未来机会。

著录项

  • 作者

    Park, Kwanghyun.;

  • 作者单位

    The University of Wisconsin - Madison.;

  • 授予单位 The University of Wisconsin - Madison.;
  • 学科 Computer science.
  • 学位 Ph.D.
  • 年度 2016
  • 页码 107 p.
  • 总页数 107
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号