首页> 外文学位 >Distributed nosql storage for extreme-scale system services in clouds and supercomputers.
【24h】

Distributed nosql storage for extreme-scale system services in clouds and supercomputers.

机译:分布式nosql存储,用于云和超级计算机中的超大规模系统服务。

获取原文
获取原文并翻译 | 示例

摘要

As supercomputers gain more parallelism at exponential rates, the storage infrastructure performance is increasing at a significantly lower rate due to relatively centralized management. This implies that the data management and data flow between the storage and compute resources is becoming the new bottleneck for large-scale applications. Similarly, cloud based distributed systems introduce other challenges stemming from the dynamic nature of cloud applications. This dissertation addresses several challenges on storage systems at extreme scales for supercomputers and clouds by designing and implementing a zero-hop distributed NoSQL storage system (ZHT), which has been tuned for the requirements of high-end computing systems. ZHT aims to be a building block for scalable distributed systems. The goals of ZHT are delivering high availability, good fault tolerance, light-weight design, persistence, dynamic joins and leaves, high throughput, and low latencies, at extreme scales (millions of nodes). We have evaluated ZHT's performance under a variety of systems, ranging from a Linux cluster with 64-nodes, an Amazon EC2 virtual cluster up to 96-nodes, to an IBM Blue Gene/P supercomputer with 8K-nodes. This work also presents several real systems that have adopted ZHT as well as other NoSQL systems, namely ZHT/Q, FusionFS, IStore, MATRIX, Slurm++, Fabriq, FREIDA-State, and WaggleDB, all of these real systems have been significantly simplified due to NoSQL storage systems, and have been shown to outperform other leading systems by orders of magnitude in some cases. Through our work, we have shown how NoSQL storage systems can help on both performance and scalability at large scales in such a variety of environments.
机译:随着超级计算机以指数速率获得更多的并行性,由于相对集中的管理,存储基础架构性能以明显较低的速率提高。这意味着存储和计算资源之间的数据管理和数据流正在成为大规模应用程序的新瓶颈。同样,基于云的分布式系统还带来了其他挑战,这些挑战源于云应用程序的动态特性。本文通过设计和实现零跳分布式NoSQL存储系统(ZHT)来解决针对超级计算机和云的极端规模存储系统的几个挑战,该系统已针对高端计算系统的要求进行了调整。 ZHT旨在成为可扩展分布式系统的构建块。 ZHT的目标是在极端规模(数百万个节点)上提供高可用性,良好的容错性,轻量级设计,持久性,动态联接和离开,高吞吐量和低延迟。我们已经评估了ZHT在各种系统上的性能,这些系统从具有64个节点的Linux集群,到96个节点的Amazon EC2虚拟集群到具有8K节点的IBM Blue Gene / P超级计算机。该工作还介绍了采用ZHT以及其他NoSQL系统的几个实际系统,即ZHT / Q,FusionFS,IStore,MATRIX,Slurm ++,Fabriq,FREIDA-State和WaggleDB,所有这些实际系统均已得到了显着简化,到NoSQL存储系统,并且在某些情况下已经证明比其他领先系统好几个数量级。通过我们的工作,我们展示了NoSQL存储系统如何在如此多种环境中大规模地提高性能和可伸缩性。

著录项

  • 作者

    Li, Tonglin.;

  • 作者单位

    Illinois Institute of Technology.;

  • 授予单位 Illinois Institute of Technology.;
  • 学科 Computer science.
  • 学位 Ph.D.
  • 年度 2015
  • 页码 140 p.
  • 总页数 140
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号