Improving Data-Analytics Performance Via Autonomic Control of Concurrency and Resource Units

Lee Gil Jae; Fortes Jose A. B.

首页> 外文期刊>ACM transactions on autonomous and adaptive systems >Improving Data-Analytics Performance Via Autonomic Control of Concurrency and Resource Units

【24h】

Improving Data-Analytics Performance Via Autonomic Control of Concurrency and Resource Units

机译：通过并发和资源单元的自主控制提高数据分析性能

获取原文

获取原文并翻译 | 示例

开具论文收录证明 >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

Many big-data processing jobs use data-analytics frameworks such as Apache Hadoop (currently also known as YARN). Such frameworks have tunable configuration parameters set by experienced system administrators and/or job developers. However, tuning parameters manually can be hard and time-consuming because it requires domain-specific knowledge and understanding of complex inter-dependencies among parameters. Most of the frameworks seek efficient resource management by assigning resource units to jobs, the maximum number of units allowed in a system being part of the static configuration of the system. This static resource management has limited effectiveness in coping with job diversity and workload dynamics, even in the case of a single job. The work reported in this article seeks to improve performance (e.g., multiple-jobs makespan and job completion time) without modification of either the framework or the applications and avoiding problems of previous self-tuning approaches based on performance models or resource usage. These problems include (1) the need for time-consuming training, typically offline and (2) unsuitability for multi-jobs/tenant environments. This article proposes a hierarchical self-tuning approach using (1) a fuzzy-logic controller to dynamically adjust the maximum number of concurrent jobs and (2) additional controllers (one for each cluster node) to adjust the maximum number of resource units assigned to jobs on each node. The fuzzy-logic controller uses fuzzy rules based on a concave-downward relationship between aggregate CPU usage and the number of concurrent jobs. The other controllers use a heuristic algorithm to adjust the number of resource units on the basis of both CPU and disk IO usage by jobs. To manage the maximum number of available resource units in each node, the controllers also take resource usage by other processes (e.g., system processes) into account. A prototype of our approach was implemented for Apache Hadoop on a cluster running at CloudLab. The proposed approach was demonstrated and evaluated with workloads composed of jobs with similar resource usage patterns as well as other realistic mixed-pattern workloads synthesized by SWIM, a statistical workload injector for MapReduce. The evaluation shows that the proposed approach yields up to a 48% reduction of the jobs makespan that results from using Hadoop-default settings.

机译：许多大数据处理作业使用数据分析框架，例如Apache Hadoop（当前也称为YARN）。这样的框架具有由经验丰富的系统管理员和/或工作开发人员设置的可调配置参数。但是，手动调整参数可能很困难且很耗时，因为它需要特定领域的知识以及对参数之间复杂的相互依赖关系的理解。大多数框架通过将资源单元分配给作业来寻求有效的资源管理，系统中允许的最大单元数是系统静态配置的一部分。这种静态资源管理在应对工作多样性和工作量动态方面的有效性有限，即使在单个工作的情况下。本文报道的工作旨在在不修改框架或应用程序的情况下提高性能（例如，多个作业的制造时间和作业完成时间），并避免基于性能模型或资源使用情况的先前自调整方法的问题。这些问题包括（1）需要耗时的培训，通常是离线培训，以及（2）不适合多职位/租户环境。本文提出了一种分层自调整方法，该方法使用（1）模糊逻辑控制器动态调整并发作业的最大数量，以及（2）附加控制器（每个集群节点一个）调整分配给该资源单元的最大数量每个节点上的作业。模糊逻辑控制器基于总CPU使用率和并发作业数之间的凹向下关系使用模糊规则。其他控制器使用启发式算法根据作业对CPU和磁盘IO的使用情况来调整资源单元的数量。为了管理每个节点中可用资源单元的最大数量，控制器还考虑了其他进程（例如系统进程）的资源使用情况。我们的方法的原型已在CloudLab上运行的集群上针对Apache Hadoop实施。演示并评估了所提出的方法，该工作负载由具有相似资源使用模式的作业以及由MapReduce的统计工作负载注入器SWIM合成的其他实际混合模式工作负载组成。评估显示，所建议的方法将使用Hadoop默认设置所产生的作业有效期减少了48％。

著录项

来源
《ACM transactions on autonomous and adaptive systems》 |2018年第3期|13.1-13.25|共25页
作者
Lee Gil Jae; Fortes Jose A. B.;
展开▼
作者单位

Univ Florida, Elect & Comp Engn Dept, POB 116200,339 Larsen Hall, Gainesville, FL 32611 USA;

Univ Florida, Elect & Comp Engn Dept, POB 116200,339 Larsen Hall, Gainesville, FL 32611 USA;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Performance tuning; self-tuning; fuzzy control; makespan; data-analytics frameworks; autonomic control; job completion time; resource utilization; Hadoop; MapReduce; Capacity Scheduler;

机译：性能调优;自调优;模糊控制;makespan;数据分析框架;自主控制;作业完成时间;资源利用率;Hadoop;MapReduce;容量调度程序;

相似文献

外文文献
中文文献
专利

1. Improving Data-Analytics Performance Via Autonomic Control of Concurrency and Resource Units [J] . Lee Gil Jae, Fortes Jose A. B. ACM transactions on autonomous and adaptive systems . 2018,第3期

机译：通过自主控制并发和资源单位来提高数据分析性能
2. Concurrent warp execution: improving performance of GPU-likely SIMD architecture by increasing resource utilization [J] . Hong Jun Choi, Dong Oh Son, Jong Myon Kim, Journal of supercomputing . 2014,第1期

机译：并发扭曲执行：通过提高资源利用率来提高类似于GPU的SIMD架构的性能
3. AIDS education programmes hit some targets: improving youth HIV prevention by sharing resources and better addressing community norms and concurrency. [J] . Plummer ML, Wight D, Obasi AI, AIDS . 2011,第8期

机译：艾滋病教育计划达到了一些目标：通过共享资源和更好地解决社区规范与并发问题来改善青年对艾滋病的预防。
4. Hierarchical Self-Tuning of Concurrency and Resource Units in Data-Analytics Frameworks [C] . Gil Jae Lee, José A. B. Fortes IEEE International Conference on Autonomic Computing . 2017

机译：数据分析框架中并发和资源单元的分层自调整
5. Semantic concurrency control, recovery, and performance profiling for improving response time in database systems [D] . Gerstl, David Scott 1998

机译：语义并发控制，恢复和性能分析，以缩短数据库系统中的响应时间
6. A 16-week Randomized Controlled Trial of a Fish Oil and Whey Protein-Derived Supplement to Improve Physical Performance in Older Adults Losing Autonomy – A Pilot Study [O] . Anne-Julie Tessier, Julia Lévy-Ndejuru, Audrey Moyen, 2020

机译：鱼油和乳清蛋白衍生补品改善失去自主性的老年人的体能的16周随机对照试验–一项初步研究
7. Improving Data-Analytics Performance Via Autonomic Control of Concurrency and Resource Units [O] . Gil Jae Lee, José A. B. Fortes 2019

机译：通过自主控制并发和资源单位来提高数据分析性能
8. Department of Energy: DOE Needs to Improve Controls Over Foreign Visitors to Its211 Weapons Laboratories. Statement of Keith O. Fultz, Assistant Comptroller General, 211 Resources, Community, and Economic Development Division [R] . 1998

机译：能源部：DOE需要改进对其211航空实验室的外国游客的控制。 211资源，社区和经济发展司助理审计长Keith O. Fultz的发言

Improving Data-Analytics Performance Via Autonomic Control of Concurrency and Resource Units

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅