首页> 外文学位 >Scheduling in MapReduce Clusters
【24h】

Scheduling in MapReduce Clusters

机译:MapReduce群集中的调度

获取原文
获取原文并翻译 | 示例

摘要

MapReduce is a framework proposed by Google for processing huge amounts of data in a distributed environment. The simplicity of the programming model and the fault-tolerance feature of the framework make it very popular in Big Data processing.;As MapReduce clusters get popular, their scheduling becomes increasingly important. On one hand, many MapReduce applications have high performance requirements, for example, on response time and/or throughput. On the other hand, with the increasing size of MapReduce clusters, the energy-efficient scheduling of MapReduce clusters becomes inevitable. These scheduling challenges, however, have not been systematically studied.;The objective of this dissertation is to provide MapReduce applications with low cost and energy consumption through the development of scheduling theory and algorithms, energy models, and energy-aware resource management. In particular, we will investigate energy-efficient scheduling in hybrid CPU-GPU MapReduce clusters. This research work is expected to have a breakthrough in Big Data processing, particularly in providing green computing to Big Data applications such as social network analysis, medical care data mining, and financial fraud detection. The tools we propose to develop are expected to increase utilization and reduce energy consumption for MapReduce clusters. In this PhD dissertation, we propose to address the aforementioned challenges by investigating and developing 1) a match-making scheduling algorithm for improving the data locality of Map- Reduce applications, 2) a real-time scheduling algorithm for heterogeneous Map- Reduce clusters, and 3) an energy-efficient scheduler for hybrid CPU-GPU Map- Reduce cluster.
机译:MapReduce是Google提出的用于在分布式环境中处理大量数据的框架。编程模型的简单性和框架的容错特性使其在大数据处理中非常受欢迎。随着MapReduce集群的流行,其调度变得越来越重要。一方面,许多MapReduce应用程序对响应时间和/或吞吐量都有很高的性能要求。另一方面,随着MapReduce集群规模的增加,MapReduce集群的节能调度变得不可避免。然而,尚未对这些调度挑战进行系统的研究。本论文的目的是通过调度理论和算法,能源模型和能源感知资源管理的发展,为MapReduce应用提供低成本和低能耗的解决方案。特别地,我们将研究混合CPU-GPU MapReduce集群中的节能调度。预计这项研究工作将在大数据处理方面取得突破,特别是在为大数据应用程序提供绿色计算方面,例如社交网络分析,医疗数据挖掘和财务欺诈检测。我们建议开发的工具有望提高MapReduce集群的利用率并减少能耗。在本博士论文中,我们建议通过研究和开发以下方面来解决上述挑战:1)一种用于提高Map-Reduce应用程序数据局部性的匹配调度算法,2)一种针对异构Map-Reduce集群的实时调度算法,和3)混合CPU-GPU Map-Reduce集群的节能调度程序。

著录项

  • 作者

    He, Chen.;

  • 作者单位

    The University of Nebraska - Lincoln.;

  • 授予单位 The University of Nebraska - Lincoln.;
  • 学科 Computer science.
  • 学位 Ph.D.
  • 年度 2018
  • 页码 124 p.
  • 总页数 124
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号