Analyzing the distribution fit for storage workload and Internet traffic traces

Wajahat Muhammad; Yele Aditya; Estro Tyler; Gandhi Anshul; Zadok Erez

首页> 外文期刊>Performance Evaluation >Analyzing the distribution fit for storage workload and Internet traffic traces

【24h】

Analyzing the distribution fit for storage workload and Internet traffic traces

机译：分析储存工作量和互联网流量迹线的分配

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Understanding workloads and modeling their performance is important for optimizing systems and services. A useful first step towards understanding the characteristics of workloads is to analyze their inter-arrival times and service requirements. If these characteristics are found to follow certain probability distributions, then corresponding stochastic models can be employed to efficiently estimate the performance of workloads. Such approaches have been explored in specific domains using an assortment of distribu-tions, including the Normal, Weibull, and Exponential. Our primary goal in this work is to understand and model storage workload performance. However, our analysis and others & rsquo; past attempts revealed that none of the commonly-employed distributions provided a good fit for storage workloads. We analyzed over 250 traces across 5 different workload families using 20 widely used distributions, including ones seldom used for storage modeling. We found that the Hyper-exponential distribution with just two phases (H-2) was superior in modeling the storage traces compared to other distributions under five diverse metrics of accuracy, including metrics that assess the risk of over-fitting. Based on these results, we developed a Markov-chain-based stochastic model that accurately estimates the storage system performance across several workload traces. To assess the applicability of the Hyper-exponential for distribution fitting beyond storage traces, we evaluated distribution fitting for Internet traffic traces using over 1,600 traces from 3 different sources. We again found that the Hyper-exponential distribution provided a superior fit compared to other probability distributions. To highlight the applicability of our model, we conducted what-if analyses to investigate (i) the storage performance impact of workload variability and garbage collection under various scenarios and (ii) the impact on service response time of Internet flash crowds. (C) 2020 Elsevier B.V. All rights reserved.

机译：了解工作负载和建模性能对于优化系统和服务非常重要。了解工作负载特征的一个有用的第一步是分析他们的到达间时间和服务要求。如果发现这些特性遵循某些概率分布，则可以采用相应的随机模型来有效地估计工作负载的性能。使用各种分配器，包括正常，Weibull和指数，在特定域中已经探讨了这种方法。我们在这项工作中的主要目标是了解和模拟存储工作负载性能。但是，我们的分析和其他＆rsquo;过去的尝试透露，普通实用的分布都没有提供良好的存储工作负载。我们使用20个广泛使用的分布分析了超过5种不同的工作负载系列的250个迹线，包括很少用于存储建模的人。我们发现，只有两个阶段（H-2）的超级指数分布在模拟存储迹线与其他五种不同精度的不同度量标准的分布相比，包括评估过度拟合风险的指标。基于这些结果，我们开发了一种基于马尔可夫链的随机模型，可准确估计多个工作负载迹线的存储系统性能。为了评估超级指数用于分配拟合超出存储迹线的应用程序，我们评估了来自3个不同源的超过1,600个迹线的互联网流量迹线的分布拟合。我们再次发现，与其他概率分布相比，超指数分布提供了优异的拟合。为了突出我们模型的适用性，我们进行了在各种情况下调查（i）工作负载变异性和垃圾收集的存储性能影响的内容 - （ii）对互联网闪存人群的服务响应时间的影响。（c）2020 Elsevier B.v.保留所有权利。

著录项

来源
《Performance Evaluation》 |2020年第9期|102121.1-102121.24|共24页
作者
Wajahat Muhammad; Yele Aditya; Estro Tyler; Gandhi Anshul; Zadok Erez;
展开▼
作者单位

SUNY Stony Brook Dept Comp Sci Stony Brook NY 11794 USA;

SUNY Stony Brook Dept Comp Sci Stony Brook NY 11794 USA;

SUNY Stony Brook Dept Comp Sci Stony Brook NY 11794 USA;

SUNY Stony Brook Dept Comp Sci Stony Brook NY 11794 USA;

SUNY Stony Brook Dept Comp Sci Stony Brook NY 11794 USA;

展开▼
收录信息美国《科学引文索引》(SCI);美国《工程索引》(EI);
原文格式 PDF
正文语种 eng
中图分类
关键词
Distribution fitting; Storage traces; Hyper-exponential; Performance modeling;

机译：分配拟合;存储迹线;超级指数;性能建模;

相似文献

外文文献
中文文献
专利

1. FITTING TRAFFIC TRACES WITH DISCRETE CANONICAL PHASE TYPE DISTRIBUTIONS AND MARKOV ARRIVAL PROCESSES [J] . Andras MESZAROS, Janos PAPP, Miklos TELEK International Journal of Applied Mathematics and Computer Science . 2014,第3期

机译：具有离散典型相类型分布和马尔可夫到达过程的交通轨迹
2. Fitting traffic traces with discrete canonical phase type distributions and Markov arrival processes [J] . András Meszáros, János Papp, Miklós Telek International journal of applied mathematics and computer science . 2014,第3期

机译：用离散的规范相位类型分布和马尔可夫到达过程拟合交通轨迹
3. Framework for Analyzing Android I/O Stack Behavior: From Generating the Workload to Analyzing the Trace* [J] . Jungwoo Hwang, Kisung Lee, Seongjin Lee, Future Internet . 2013,第4期

机译：分析Android I / O堆栈行为的框架：从生成工作量到分析跟踪*
4. Distribution Fitting and Performance Modeling for Storage Traces [C] . Muhammad Wajahat, Aditya Yele, Tyler Estro, IEEE International Symposium on Modeling, Analysis, and Simulation of Computer and Telecommunication Systems . 2019

机译：存储跟踪的分布拟合和性能建模
5. Statistical Characterization of Storage System Workloads for Data Deduplication and Load Placement in Heterogeneous Storage Environments. [D] . Park, Nohhyun. 2013

机译：异构存储环境中用于重复数据删除和负载放置的存储系统工作负载的统计特性。
6. An efficient method to detect periodic behavior in botnet traffic by analyzing control plane traffic [O] . Basil AsSadhan, José M.F. Moura 2014

机译：通过分析控制平面流量来检测僵尸网络流量中周期性行为的有效方法
7. Analyzing the Potential Benefits of CDN Augmentation Strategies for Internet Video Workloads [O] . Athula Balachandran, Vyas Sekar, Aditya Akella, 2013

机译：分析互联网视频工作量的CDN增强策略的潜在好处

Analyzing the distribution fit for storage workload and Internet traffic traces

摘要

著录项

相似文献

相关主题

期刊订阅