首页> 外文学位 >Data center resource management with temporal dynamic workload.
【24h】

Data center resource management with temporal dynamic workload.

机译:具有临时动态工作负载的数据中心资源管理。

获取原文
获取原文并翻译 | 示例

摘要

The proliferation of Internet services drives the data center expansion in both size and the number. More importantly, the energy consumption (as part of total cost of ownership (TCO)) has become a social concern. When the workload demand is given, the data center operators desire minimizing their TCO. On the other hand, when the workload demand is unknown while the requirements on quality (QoE) of experience of the Internet services are given, the data center operators need to determine appropriate amount of resources and design redirection strategies in presence of multiple data centers to guarantee the QoE.;For the first problem, we present formulations to minimize server energy consumption and server cost under three different data enter scenarios (homogeneous, heterogeneous, hybrid hetero-homogeneous clusters) with dynamic temporal demand. Our studies show that the homogeneous model significantly differs the heterogenous model in computational time. To be able to compute optimal configurations in near real-time for large scale data centers, we propose aggregation by maximum and aggregation by mean modes for different Internet service requirements. In aggregation by maximum mode, the price of reducing computational time is over-provisioning (which causes extra energy consumption). In aggregation by mean mode, the price is the degradation of the timeliness of services. However, they still result in significant cost savings compared to the scenario when all servers are on during the entire duration. We first introduce an intuitive aggregation method: static aggregation. For each mode, dynamic aggregation is introduced to alleviate their individual drawbacks. The dynamic aggregation by maximum results in cost savings up to approximately 18% over the static aggregation by maximum. For three random distributed workload cases, the dynamic aggregation by mean can save up to approximately 50% workload reallocation compared to static aggregation. Dynamic Voltage/Frequency Scaling (DVFS) capacity is further considered in our model. Our numerical results show that adopting DVFS results in significantly reduction of energy consumption.;For the second problem, the data center provides resources via the cloud computing model. We propose a hierarchical modeling approach that can easily combine all components in the data center provisioning environment. Identifying interactions among the components is the key to construct such a model. In providing internet service by cloud computing hosted in data centers, we first construct four sub-models: an outbound bandwidth model, a cloud computing (hosted by data centers) availability model, a latency model and a cloud computing response time model. Then we use a data center redirection strategy graph to glue them together. We also introduce an all-in-one barometer to ease the QoE evaluation. The numeric results show that our model serves as a very useful analytical tool for data center operators to provide appropriate resources as well as design redirection strategies.;In addition, we study the redirection strategies (schemes) in a particular Internet service, agent-based virtual private networks architecture (ABVA). It refers to the environment where a third-party provider runs and administers remote access virtual private network (VPN) service for organizations that do not want to maintain their own in-house VPN servers. We consider the problem of optimally connecting users of an organization to VPN server locations in an ABVA environment so that request denial probability and latency are balanced. A user request needs a certain bandwidth between the user and the VPN server. The VPN server may deny requests when the bandwidth is insufficient (capacity limitation). At the same time, the latency perceived by a user from its current location to a VPN server is an important consideration. We present a number of strategies regarding how VPN servers are to be selected and the number of servers to be tried so that request denial probability is minimized without unduly affecting latency. These strategies are studied on a number of different topologies. For our study, we consider Poisson and non-Poisson arrival of requests under both finite and infinite population models to understand the impact on the entire system. We found that the arrival processes have a significant and consistent impact on the request denial probability and the impact on the latency is dependent on the traffic load in the infinite model. In the finite model, arrival processes have an inconsistent impact to the request denial probability. As to the latency in the finite model, arrivals that have a squared co-efficient of variation less than one is consistently largest, followed by Poisson case, then the case that the squared co-efficient of variation is more than one. Finally, a strength of this work is the comparison of infinite and finite models; we found that a mismatch between the infinite and the finite model is dependent both on the number of users in the system and the load.
机译:Internet服务的激增推动了数据中心规模和数量的扩展。更重要的是,能源消耗(作为总拥有成本(TCO)的一部分)已成为社会关注的问题。当给出工作负载需求时,数据中心运营商希望将其总拥有成本降至最低。另一方面,当工作负载需求未知,同时给出了Internet服务的体验质量(QoE)要求时,数据中心运营商需要确定适当的资源量,并在存在多个数据中心的情况下设计重定向策略,以实现以下目标:对于第一个问题,我们提出了在具有动态时间需求的三种不同数据输入场景(同质,异质,混合异质同质集群)下最小化服务器能耗和服务器成本的公式。我们的研究表明,均质模型在计算时间上与异质模型显着不同。为了能够为大型数据中心近乎实时地计算最佳配置,我们提出了针对不同Internet服务需求的最大聚合和平均模式聚合。在以最大模式进行聚合时,减少计算时间的代价是过度配置(这会导致额外的能耗)。在均值聚合模式下,价格是服务及时性的下降。但是,与在整个持续时间内所有服务器都处于打开状态的情况相比,它们仍然可以节省大量成本。我们首先介绍一种直观的聚合方法:静态聚合。对于每种模式,都引入了动态聚合以减轻其各自的缺点。通过最大程度的动态聚合,最多可以比静态聚合节省大约18%的成本。对于三种随机分布的工作负载情况,与静态聚合相比,动态聚合平均可以节省大约50%的工作负载重新分配。我们的模型中进一步考虑了动态电压/频率缩放(DVFS)容量。我们的数值结果表明,采用DVFS可以显着降低能耗。对于第二个问题,数据中心通过云计算模型提供资源。我们提出了一种分层建模方法,该方法可以轻松地组合数据中心供应环境中的所有组件。识别组件之间的交互是构建这种模型的关键。在通过数据中心托管的云计算提供互联网服务时,我们首先构建四个子模型:出站带宽模型,云计算(由数据中心托管)可用性模型,等待时间模型和云计算响应时间模型。然后,我们使用数据中心重定向策略图将它们粘合在一起。我们还推出了多合一气压计,以简化QoE评估。数值结果表明,我们的模型是数据中心运营商提供适当资源以及设计重定向策略的非常有用的分析工具。此外,我们研究了基于代理的特定Internet服务中的重定向策略(方案)虚拟专用网络架构(ABVA)。它指的是第三方提供商为不想维护自己的内部VPN服务器的组织运行和管理远程访问虚拟专用网络(VPN)服务的环境。我们考虑了在ABVA环境中最佳地将组织的用户连接到VPN服务器位置的问题,以便平衡请求拒绝概率和延迟。用户请求在用户和VPN服务器之间需要一定的带宽。当带宽不足(容量限制)时,VPN服务器可能会拒绝请求。同时,用户从其当前位置到VPN服务器的感知延迟是一个重要的考虑因素。我们提出了许多有关如何选择VPN服务器以及要尝试的服务器数量的策略,以使请求拒绝概率最小化,而不会过度影响延迟。这些策略在许多不同的拓扑上进行了研究。对于我们的研究,我们在有限和无限总体模型下考虑泊松和非泊松请求的到达,以了解对整个系统的影响。我们发现,到达过程对请求拒绝概率具有显着且一致的影响,而对延迟的影响则取决于无限模型中的流量负载。在有限模型中,到达过程对请求拒绝概率有不一致的影响。关于有限模型中的等待时间,方差系数小于1的到达始终是最大的,其次是Poisson情况,然后方差系数大于1的情况。最后,这项工作的优势是无限模型与有限模型的比较;我们发现无限模型与有限模型之间的不匹配取决于系统中的用户数量和负载。

著录项

  • 作者

    Qian, Haiyang.;

  • 作者单位

    University of Missouri - Kansas City.;

  • 授予单位 University of Missouri - Kansas City.;
  • 学科 Engineering Computer.;Engineering Electronics and Electrical.;Computer Science.
  • 学位 Ph.D.
  • 年度 2012
  • 页码 179 p.
  • 总页数 179
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号