【24h】

Analysis and Clustering of Workload in Google Cluster Trace Based on Resource Usage

机译:基于资源使用情况的Google集群跟踪中的工作负载分析和集群

获取原文
获取原文并翻译 | 示例

摘要

Cloud computing has gained interest amongst commercial organizations, research communities, developers and other individuals during the past few years. In order to move ahead with research in field of data management and to enable processing of such data, we need benchmark datasets and freely available data which are publicly accessible. Google in May 2011 released a trace of a cluster of 11k machines referred as "Google Cluster Trace". This trace contains cell information of about 29 days. This paper provides analysis of resource usage and requirements in this trace and is an attempt to give an insight about such kind of production trace similar to the ones in cloud environment. The major contributions of this paper include statistical profile of jobs based on resource usage, clustering of workload patterns and classification of jobs into different types based on k-means clustering. Though there have been earlier works for analysis of this trace, but our analysis provides several new findings such as jobs in a production trace are trimodal and there occurs symmetry in the tasks within a long job type.
机译:在过去的几年中,云计算已引起商业组织,研究社区,开发人员和其他个人的兴趣。为了推进数据管理领域的研究并实现此类数据的处理,我们需要基准数据集和可公开访问的免费数据。 Google在2011年5月发布了11k机器集群的踪迹,称为“ Google Cluster Trace”。此跟踪包含大约29天的单元格信息。本文提供了此跟踪中资源使用情况和需求的分析,并试图提供一种类似于云环境中的生产跟踪的见解。本文的主要贡献包括基于资源使用情况的作业统计资料,工作负载模式的聚类以及基于k均值聚类的不同类型的作业分类。尽管已经有较早的工作来分析这种痕迹,但是我们的分析提供了一些新的发现,例如生产痕迹中的作业是三峰的,并且在长作业类型中的任务中存在对称性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号