Towards Batch-Processing on Cold Storage Devices

机译：在冷库设备上批量处理

获取原文

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

Large amounts of data in storage systems is cold, i.e., Written Once and Read Occasionally (WORO). The rapid growth of massive-scale archival and historical data increases the demand for petabyte-scale cheap storage for such cold data. A Cold Storage Device (CSD) is a disk-based storage system which is designed to trade off performance for cost and power efficiency. Inevitably, the design restrictions used in CSD's results in performance limitations. These limitations are not a concern for WORO workloads, however, the very low price/performance characteristics of CSDs makes them interesting for other applications, e.g., batch processes, too. Applications, however, can be very slow on CSD's if they do not take their characteristics into account. In this paper we design two strategies for data partitioning in CSDs -- a crucial operation in many batch analytics tasks like hash-join, near-duplicate detection, and data localization. We show that our strategies can efficiently use CSDs for batch processing of terabyte-scale data by accelerating data partitioning by 3.5x in our experiments.

机译：存储系统中的大量数据是冷的，即，写一次并偶尔读取（Woro）。大规模档案档案和历史数据的快速增长会增加对这种冷数据的Petabyte规模廉价存储的需求。冷库设备（CSD）是基于磁盘的存储系统，该存储系统旨在为成本和功率效率进行衡量性能。不可避免地，CSD中使用的设计限制导致性能限制。这些限制不是对Woro工作负载的关注，但是，CSD的非常低的价格/性能特征使得它们对于其他应用程序也有趣，例如批处理过程。但是，如果他们没有考虑到他们的特征，则可以对CSD进行非常慢的应用程序。在本文中，我们设计了两个用于CSD中的数据分区的策略 - 许多批次分析任务中的重要操作，如Hash-Join，近重复检测和数据定位。我们表明我们的策略可以通过在我们的实验中加速数据分区3.5倍来有效地使用CSD进行批量处理Tberbyte-Scale数据。

著录项

来源
《IEEE International Conference on Data Engineering Workshops》|2018年|176p|共6页
会议地点
作者
Ali Hadian; Thomas Heinis;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP274-53;
关键词
Switches; Task analysis; Data processing; Batch production systems; Databases; Random access memory; Performance evaluation;

机译：交换机;任务分析;数据处理;批量生产系统;数据库;随机存取存储器;性能评估;

相似文献

外文文献
中文文献
专利

1. Heart transplantation of patients with ventricular assist devices: impact of normothermic ex-vivo preservation using organ care system compared with cold storage [J] . Rymbay Kaliyev, Timur Lesbekov, Serik Bekbossynov, Journal of Cardiothoracic Surgery . 2020,第1期

机译：心室辅助装置患者的心脏移植：使用器官护理系统对室温前体内保存的影响与冷藏系统相比
2. Cold engine cranking by means of modern energy storage devices - physical simulation [J] . Alexey B. Tarasenko, Tatiana S. Gabderakhmanova, Sophia V. Kiseleva, MATEC Web of Conferences . 2018,第3期

机译：通过现代储能设备启动冷发动机-物理模拟
3. Theoretical Analysis of Cold Storage Devices in a CO_2 Transcritical/Subcritical Supermarket Refrigeration Plant [J] . Claudio Ferrandi, Maurizio Orlandi Journal of energy and power engineering . 2013,第1期

机译：CO_2跨临界/亚临界超市制冷厂冷库设备的理论分析
4. Towards Batch-Processing on Cold Storage Devices [C] . Ali Hadian, Thomas Heinis 2018 IEEE 34th International Conference on Data Engineering Workshops . 2018

机译：走向冷藏设备上的批处理
5. Hot and Cold Data Identification: Applications to Storage Devices and Systems. [D] . Park, Dongchul. 2012

机译：冷热数据识别：在存储设备和系统中的应用。
6. Heart transplantation of patients with ventricular assist devices: impact of normothermic ex-vivo preservation using organ care system compared with cold storage [O] . Rymbay Kaliyev, Timur Lesbekov, Serik Bekbossynov, 2020

机译：心室辅助装置患者的心脏移植：使用器官护理系统对室温前体内保存的影响与冷藏系统相比
7. Heart transplantation of patients with ventricular assist devices: impact of normothermic ex-vivo preservation using organ care system compared with cold storage [O] . Rymbay Kaliyev, Timur Lesbekov, Serik Bekbossynov, 2020

机译：心室辅助装置患者的心脏移植：使用器官护理系统对室温前体内保存的影响与冷藏系统相比

Towards Batch-Processing on Cold Storage Devices

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅