首页> 外文会议>IEEE International Congress on Big Data >Two-mode data distribution scheme for heterogeneous storage in data centers
【24h】

Two-mode data distribution scheme for heterogeneous storage in data centers

机译:数据中心异构存储的双模数据分配方案

获取原文

摘要

Fast growing "Big Data" demands present new challenges to the traditional distributed storage system solutions. In order to support cloud-scale data centers, new types of distributed storage systems are emerging. They are designed to scale to thousands of nodes, maintain petabytes of data and be highly reliable. The support for virtual machines is also becoming essential as it is one of the most important technology that supports cloud computing. To meet these needs, these distributed storage systems are implemented with advanced data distribution schemes. Data are striped and distributed across the storage cluster based on distribution algorithms instead of mapping tables. The existing algorithms usually balance the data distribution across nodes proportional to their capacity. However, they overlook distinct performance characteristics across different nodes and devices in the emerging heterogeneous storage environment. We propose a two-mode data distribution scheme in this study to maximize the overall performance and keep data balanced across the storage cluster at the same time. The working principle of the two-mode data distribution scheme is provided. We also present a new data read and write strategy to work with the two-mode scheme. We evaluate the computation time for data distribution using two-mode scheme and analyze its implication on the overall IO performance. We expect significant performance improvement while it still needs more analytical and experimental evaluation to further examine the details.
机译:快速增长的“大数据”要求对传统分布式存储系统解决方案的新挑战带来了新的挑战。为了支持云级数据中心,新型的分布式存储系统正在出现。它们旨在缩放到数千个节点,维护数据的Petabytes并高度可靠。对虚拟机的支持也变得必不可少,因为它是支持云计算的最重要技术之一。为了满足这些需求,这些分布式存储系统通过高级数据分配方案实现。基于分发算法而不是映射表来横跨存储群集地分割和分发数据。现有算法通常平衡与其容量成比例的节点的数据分布。然而,它们在新出现的异构存储环境中俯瞰不同节点和设备的不同性能特征。我们在本研究中提出了一种双模式数据分配方案,以最大限度地提高整体性能,并同时保持数据平衡数据。提供了双模数据分布方案的工作原理。我们还提出了一种新的数据读写策略来使用双模式方案。我们使用双模方案评估数据分布的计算时间,并分析其对整体IO性能的含义。我们预计仍然需要更大的性能改善,而仍需要更多的分析和实验评估,以进一步检查细节。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号