Fair Scheduling in Cloud Datacenters with Multiple Resource Types.

机译：具有多种资源类型的Cloud Datacenter中的公平调度。

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

This dissertation focuses on algorithm design and prototype implementation of fair sharing policies in cloud datacenters with multiple resource types. Specifically, it seeks to address two fundamental resource management problems.;First, how should the the computing resources of a large-scale cluster ---such as CPU cores, memory, and storage --- be fairly shared among running applications? This problem has become even more complicated in cloud datacenters, mainly due to their unprecedented heterogeneity and complexity. Cloud computing clusters are likely to be constructed from a large number of commodity servers spanning multiple generations, with different computing capabilities, bandwidth, and storage capacities. On the other hand, depending on the underlying applications, computing tasks may require vastly different amounts of resources: some are CPU-intensive; some are memory- or bandwidth-bound. Despite these complexities, existing resource sharing policies either result in significant resource fragmentation, or require a homogeneous cluster where servers are of the same specification.;This dissertation proposes Dominant Resource Fairness in Heterogeneous systems (DRFH), a general multi-resource sharing policy that preserves many axiomatically, and highly desirable "fair" properties that used to be provided by weighted fair sharing in the single-resource setting. DRFH eliminates the application's incentive of cheating the cluster scheduler for more allocation share by misreporting its resource requirement, a strategic behaviour commonly observed in production clusters. Prototype implementation and trace-driven simulations show that DRFH can be easily enforced in cluster systems, with higher resource utilization and shorter job completion time than the existing resource sharing policies.;Second, how should the active flows fairly share the network resources of middleboxes, software routers, and other appliances that are widely deployed in datacenters? In middleboxes or software routers, flows usually undergo deep packet inspection, which requires the support of multiple types of resources, and may bottleneck on either CPU, memory bandwidth, or link bandwidth. While there is rich literature of fair queueing for a single type of resource (i.e., link bandwidth), it remains unclear how to schedule multiple resources in middleboxes to achieve fair sharing among flows. A similar problem also arises in virtual machine (VM) scheduling inside a hypervisor, where different VMs may consume different amounts of resources, and it is desirable to fairly multiplex their access to physical resources.;To answer these challenges, this dissertation proposes Multi-resource Round Robin (MR3) that serves flows in rounds and achieves near-perfect fairness in O(1) time. MR3 serves as a foundation for a more general fair scheduler, called Group Multi-Resource Round Robin (GMR3). GMR3 also runs in O(1) time, yet provides weight-proportional packet latency when flows are assigned uneven weights.;This dissertation also identifies a new challenge that is unique to multi-resource scheduling: the general tradeoff between fairness and efficiency. Such a tradeoff has never been a problem for traditional fair sharing of link bandwidth. As long as the queueing algorithm is work conserving, the bandwidth is always fully utilized, and fairness is the only concern. However, in the presence of multiple resource types, fairness and efficiency are conflicting objectives that cannot be achieved at the same time. Motivated by this problem, a new queueing algorithm is proposed and prototyped. It allows network operator to flexibly specify her tradeoff preference and implements the specified tradeoff by determining the right packet scheduling order.

机译：本文主要研究多种资源类型的云数据中心公平共享策略的算法设计和原型实现。具体来说，它寻求解决两个基本的资源管理问题：首先，应该如何在正在运行的应用程序之间公平地共享大型集群的计算资源（例如CPU核心，内存和存储）？这个问题在云数据中心中变得更加复杂，这主要是由于它们前所未有的异构性和复杂性。云计算集群很可能由跨越多个世代的大量商品服务器构建而成，具有不同的计算功能，带宽和存储容量。另一方面，取决于基础应用程序，计算任务可能需要大量不同的资源：有些资源占用大量CPU；有些资源占用大量CPU资源。有些是受内存或带宽限制的。尽管存在这些复杂性，现有的资源共享策略还是会导致严重的资源碎片化，或者需要服务器规格相同的同质集群。本文提出了异构系统中的优势资源公平性（DRFH），一种通用的多资源共享策略保留了许多公理的，非常可取的“公平”属性，这些属性过去通常是由单一资源设置中的加权公平共享提供的。 DRFH通过错误地报告其资源需求来消除应用程序欺骗集群调度程序以获得更多分配份额的动机，这是生产集群中常见的一种战略行为。原型实现和跟踪驱动的仿真表明，与现有的资源共享策略相比，DRFH可以轻松地在集群系统中实施，具有更高的资源利用率和更短的作业完成时间。其次，活动流应如何公平地共享中间盒的网络资源，软件路由器以及广泛部署在数据中心中的其他设备？在中间盒或软件路由器中，流通常会进行深度数据包检查，这需要支持多种类型的资源，并且可能在CPU，内存带宽或链接带宽上造成瓶颈。虽然有大量文献针对单一类型的资源（即链路带宽）进行公平排队，但仍不清楚如何在中间盒中调度多个资源以实现流之间的公平共享。在虚拟机管理程序内部的虚拟机（VM）调度中也会出现类似的问题，其中不同的VM可能会消耗不同数量的资源，因此希望公平地复用其对物理资源的访问。为解决这些挑战，本论文提出了“ Multi-服务于循环的资源Round Robin（MR3），并在O（1）时间内实现近乎完美的公平性。 MR3是更通用的公平调度程序（称为组多资源循环调度（GMR3））的基础。 GMR3还以O（1）时间运行，但是当为流分配不均匀的权重时，它提供了与权重成比例的数据包延迟。该论文还确定了多资源调度所特有的新挑战：公平与效率之间的一般权衡。对于传统的公平共享链路带宽而言，这种折衷从来就不是问题。只要排队算法是节省工作的，带宽就始终得到充分利用，而公平性是唯一要考虑的问题。但是，在存在多种资源类型的情况下，公平性和效率是相互矛盾的目标，无法同时实现。为此，提出了一种新的排队算法并进行了原型设计。它允许网络运营商灵活地指定其权衡优先级，并通过确定正确的数据包调度顺序来实现指定的权衡。

著录项

作者
Wang, Wei.;
展开▼
作者单位

University of Toronto (Canada).;

展开▼
授予单位 University of Toronto (Canada).;
学科 Computer engineering.;Computer science.
学位 Ph.D.
年度 2015
页码 157 p.
总页数 157
原文格式 PDF
正文语种 eng
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. Dominant Fairness Fairness: Hierarchical Scheduling for Multiple Resources in Heterogeneous Datacenters [J] . Wenzhu WANG, Kun JIANG, Yusong TAN, IEICE transactions on information and systems . 2016,第6期

机译：支配性公平性：异构数据中心中多种资源的分层调度
2. Multiple context based service scheduling for balancing cost and benefits of mobile users and cloud datacenter supplier in mobile cloud [J] . Li Chunlin, Yan Xin, Zhang Yang, Computer networks . 2017,第jula20期

机译：基于多上下文的服务调度，可在移动云中平衡移动用户和云数据中心供应商的成本和收益
3. Job Scheduling with Efficient Resource Monitoring in Cloud Datacenter [J] . ShyamalaLoganathan, SaswatiMukherjee ScientificWorldJournal . 2015,第3期

机译：云数据中心中有效资源监控的作业调度
4. Reciprocal Resource Fairness: Towards Cooperative Multiple-Resource Fair Sharing in IaaS Clouds [C] . Haikun Liu, Bingsheng He International Conference for High Performance Computing, Networking, Storage and Analysis . 2014

机译：互惠资源公平：在IaaS云中实现协作式多资源公平共享
5. Machine Learning for Automatic Resource Management in the Datacenter and the Cloud [D] . Yadwadkar, Neeraja Jayant. 2018

机译：用于数据中心和云中自动资源管理的机器学习
6. Job Scheduling with Efficient Resource Monitoring in Cloud Datacenter [O] . Shyamala Loganathan, Saswati Mukherjee 2015

机译：云数据中心中具有有效资源监控的作业调度
7. MRFS: A Multi-resource Fair Scheduling Algorithm in Heterogeneous Cloud Computing [O] . Hamed Hamzeh, Sofia Meacham, Kashaf Khan, 2020

机译：MRFS：异构云计算中的多资源公平调度算法

Fair Scheduling in Cloud Datacenters with Multiple Resource Types.

摘要

著录项

相似文献

相关主题

期刊订阅