首页> 外文会议>IEEE International Conference on Networking, Architecture and Storage >Workload Shifting: Contention-Insular Disk Arrays for Big Data Systems
【24h】

Workload Shifting: Contention-Insular Disk Arrays for Big Data Systems

机译:工作负载转移:适用于大数据系统的竞争性独立磁盘阵列

获取原文

摘要

It is well known that in-place update index, unordered log structured index and ordered log structured index are three typical data organizations which are designed to meet different workload requirements respectively and wildly used in big data storage systems. Differentiated workload requirements in different phase of the data lifecycle, e.g. various types of data are injected into the big data storage systems in the write optimized manner, then they are needed to be read in the read optimized manner for analysis, lead to data organization transformation(data transformation for short). However, the simple mixture of foreground data injection and background data transformation causes serious disk contention. Frequent disk head seeks result in low disk throughput, and not only prolong the data transformation process, but also increase foreground data injection latency. In this paper, we propose emph{Workload Shifting}, a novel log- structured design that shifts background data transformation away from the foreground data injection. Compared with conventional RAID0 disk array, emph{Workload Shifting} effectively isolates background data transformation and foreground data injections, avoids the disk contention between them to boost their performance. We have implemented emph{Workload Shifting} prototype on one multiple disks based disk array. Extensive experimental evaluation results show that compared with conventional RAID0 disk arrays, emph{Workload Shifting} can avoid disk contention and speed up both data injection and data transformation significantly.
机译:众所周知,就地更新索引,无序日志结构化索引和有序日志结构化索引是三种典型的数据组织,它们分别设计用于满足不同的工作负载要求,并广泛用于大数据存储系统中。数据生命周期不同阶段的差异化工作负载需求各种类型的数据以写优化方式注入到大数据存储系统中,然后需要以读优化方式进行读取以进行分析,从而导致数据组织转换(简称数据转换)。但是,前台数据注入和后台数据转换的简单混合会导致严重的磁盘争用。频繁的磁盘头搜索导致磁盘吞吐量降低,不仅延长了数据转换过程,而且还增加了前台数据注入延迟。在本文中,我们提出了\ emph {Workload Shifting},这是一种新颖的日志结构设计,可将背景数据转换从前台数据注入中转移出来。与传统的RAID0磁盘阵列相比,\ emph {Workload Shifting}有效地隔离了后台数据转换和前景数据注入,避免了它们之间的磁盘争用以提高其性能。我们已经在一个基于多个磁盘的磁盘阵列上实现了\ emph {Workload Shifting}原型。大量的实验评估结果表明,与常规RAID0磁盘阵列相比,\ emph {Workload Shifting}可以避免磁盘争用并显着加快数据注入和数据转换的速度。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号