首页> 外文学位 >Scalable parallel computing on clouds: Efficient and scalable architectures to perform pleasingly parallel, MapReduce and iterative data intensive computations on cloud environments.
【24h】

Scalable parallel computing on clouds: Efficient and scalable architectures to perform pleasingly parallel, MapReduce and iterative data intensive computations on cloud environments.

机译:云上的可伸缩并行计算:高效且可伸缩的架构,可在云环境上执行令人满意的并行,MapReduce和迭代式数据密集型计算。

获取原文
获取原文并翻译 | 示例

摘要

Over the last decade, three major disruptive trends driven by the software industry altered the scalable parallel computing landscape. These disruptions are the data deluge (i.e., shift to data-intensive from compute-intensive), next generation compute and storage frameworks based on MapReduce, and the utility computing model introduced by cloud computing environments. This thesis focuses on the intersection of these three disruptions and evaluates the feasibility of using cloud computing environments to perform large-scale, data-intensive computations using next-generation programming and execution frameworks. The current key challenges for performing scalable parallel computing in cloud environments include identifying suitable application patterns, identifying efficient and easy-to-use programing abstractions to represent those patterns, performing appropriate task partitioning and task scheduling, identifying suitable data storage and staging architectures, utilizing suitable communication patterns, and identifying appropriate fault tolerance mechanisms.;This thesis will identify three types of application patterns that are well suited for cloud environments. Presented first are pleasingly parallel computations, including pleasingly parallel programming frameworks for cloud environments. Secondly, MapReduce-type applications are explored, including a decentralized architecture and a prototype implementation to develop MapReduce frameworks using cloud infrastructure services. Third and finally, data-intensive iterative applications, which encompass many graph processing algorithms, machine-learning algorithms, and more, are considered. We present the Twister4Azure architecture and runtime as a solution for implementation of data-intensive iterative applications in cloud environments. Twister4Azure architecture extends the familiar, easy-to-use MapReduce programming model with iterative extensions and iterative specific optimizations, enabling a wide array of large-scale iterative and non-iterative data analysis and scientific applications to utilize cloud platforms easily and efficiently in a fault-tolerant manner.;Collective communication operations facilitate the optimized communication and coordination between groups of nodes of distributed computations, which leads to many advantages. We also present the applicability of collective communication operations to the iterative MapReduce computations on cloud and cluster environments, enriching these computations with additional application patterns without sacrificing the desirable properties of the MapReduce model. The addition of collective communication operations enhances the iterative MapReduce model by offering many performance improvements and ease-of-use advantages.
机译:在过去的十年中,软件行业驱动的三大颠覆性趋势改变了可扩展并行计算的格局。这些破坏包括数据泛滥(即从计算密集型转变为数据密集型),基于MapReduce的下一代计算和存储框架以及云计算环境引入的效用计算模型。本文着眼于这三个中断的交叉点,并评估了使用云计算环境通过下一代编程和执行框架执行大规模,数据密集型计算的可行性。在云环境中执行可伸缩并行计算的当前主要挑战包括:确定合适的应用程序模式,确定代表这些模式的高效且易于使用的编程抽象,执行合适的任务分区和任务调度,确定合适的数据存储和分段架构,利用合适的通信模式,并确定合适的容错机制。本文将确定三种非常适合云环境的应用程序模式。首先介绍令人愉悦的并行计算,包括针对云环境的令人愉悦的并行编程框架。其次,探索了MapReduce类型的应用程序,包括分散的体系结构和使用云基础架构服务开发MapReduce框架的原型实现。第三,也是最后,考虑了数据密集型迭代应用程序,其中包括许多图形处理算法,机器学习算法等。我们介绍了Twister4Azure架构和运行时,作为在云环境中实现数据密集型迭代应用程序的解决方案。 Twister4Azure架构通过迭代扩展和迭代特定优化扩展了熟悉的,易于使用的MapReduce编程模型,从而使各种大规模的迭代和非迭代数据分析以及科学应用程序能够在故障中轻松高效地利用云平台。容忍方式。集体通信操作促进了分布式计算的节点组之间的优化通信和协调,这带来了许多优势。我们还介绍了集体通信操作在云和集群环境中对迭代MapReduce计算的适用性,并在不牺牲MapReduce模型的理想特性的情况下,通过其他应用程序模式丰富了这些计算。集体通信操作的添加通过提供许多性能改进和易用性优势,增强了迭代MapReduce模型。

著录项

  • 作者

    Gunarathne, Thilina.;

  • 作者单位

    Indiana University.;

  • 授予单位 Indiana University.;
  • 学科 Computer Science.
  • 学位 Ph.D.
  • 年度 2014
  • 页码 208 p.
  • 总页数 208
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

  • 入库时间 2022-08-17 11:53:52

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号