MURS: Mitigating Memory Pressure in Service-oriented Data Processing Systems

机译：Murs：减轻面向服务的数据处理系统中的记忆压力

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Although a data processing system often works as a batch processing system, many enterprises deploy such a system as a service, which we call the service-oriented data processing system. It has been shown that in-memory data processing systems suffer from serious memory pressure. The situation becomes even worse for the service-oriented data processing systems due to various reasons. For example, in a service-oriented system, multiple submitted tasks are launched at the same time and executed in the same context in the resources, compared with the batch processing mode where the tasks are processed one by one. Therefore, the memory pressure will affect all submitted tasks, including the tasks that only incur the light memory pressure when they are run alone. In this paper, we find that the reason why memory pressure arises is because the running tasks produce massive long-living data objects in the limited memory space. Our studies further reveal that the long-living data objects are generated by the API functions that are invoked by the in-memory processing frameworks. Based on these findings, we propose a method to classify the API functions based on the memory usage rate. Further, we design a scheduler called MURS to mitigate the memory pressure. We implement MURS in Spark and conduct the experiments to evaluate the performance of MURS. The results show that when comparing to Spark, MURS can 1) decrease the execution time of the submitted jobs by up to 65.8%, 2) mitigate the memory pressure in the server by decreasing the garbage collection time by up to 81%, and 3) reduce the data spilling, and hence disk I/O, by approximately 90%.

机译：虽然数据处理系统通常作为批处理系统工作，但许多企业将如此系统部署为服务，我们称之为面向服务的数据处理系统。已经表明，内存数据处理系统遭受严重的记忆压力。由于各种原因，面向服务的数据处理系统的情况变得更糟。例如，在面向服务的系统中，与批处理模式相比，在资源中的相同上下文中同时启动多个提交的任务，并在其中一个接一个地处理任务。因此，记忆压力将影响所有提交的任务，包括仅在单独运行时引起灯记忆压力的任务。在本文中，我们发现内存压力出现的原因是因为运行任务在有限的存储空间中产生大量的长生物数据对象。我们的研究进一步揭示了长生物数据对象由内存处理框架调用的API函数生成。基于这些发现，我们提出了一种基于内存使用率对API函数进行分类的方法。此外，我们设计一个名为Murs的调度程序来缓解内存压力。我们在火花中实施Murs，并进行实验以评估Murs的表现。结果表明，当与火花相比，Murs Can 1）将提交工作的执行时间减少到65.8％，2）通过将垃圾收集时间减少到81％，3时，减轻服务器中的内存压力）减少数据溢出，因此磁盘I / O，大约90％。

著录项

来源
《IEEE International Conference on Web Services》|2017年|923p|共8页
会议地点
作者
Xuanhua Shi; Xiong Zhang; Ligang He; Hai Jin; Zhixiang Ke; Song Wu;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP393-53;
关键词
Memory pressure; Memory usage rate; Task scheduler; Data processing systems; Service-oriented;

机译：记忆力;内存使用率;任务调度程序;数据处理系统;面向服务;

相似文献

外文文献
中文文献
专利

1. Service-oriented architecture of adaptive, intelligent data acquisition and processing systems for long-pulse fusion experiments [J] . J. Gonzalez, rnM. Ruiz, rnE. Barrera, Fusion Engineering and Design . 2010,第3a4期

机译：面向服务的自适应，智能数据采集和处理系统的体系结构，用于长脉冲融合实验
2. Memory-side prefetching for linked data structures for processor-in-memory systems [J] . Hughes CJ, Adve SV Journal of Parallel and Distributed Computing . 2005,第4期

机译：内存侧预取，用于内存中处理器系统的链接数据结构
3. Data Prefetching and Eviction Mechanisms of In-Memory Storage Systems Based on Scheduling for Big Data Processing [J] . Chen Chien-Hung, Hsia Ting-Yuan, Huang Yennun, IEEE Transactions on Parallel and Distributed Systems . 2019,第8期

机译：基于调度的大数据处理内存存储系统数据预取与收回机制
4. MURS: Mitigating Memory Pressure in Service-oriented Data Processing Systems [C] . Xuanhua Shi, Xiong Zhang, Ligang He, IEEE International Conference on Web Services . 2017

机译：Murs：减轻面向服务的数据处理系统中的记忆压力
5. Streamlining Big Data Processing Pipelines via Unix Memory Tools, Persistent Spark Datasets, and the Apache Ignite Inmemory File System [D] . Blair, Walter 2018

机译：通过Unix内存工具，持久性Spark数据集和Apache Ignite内存文件系统简化大数据处理管道
6. Joint Optimization of Process Flow and Scheduling in Service-Oriented Manufacturing Systems [O] . Joe Vargas, Roque Calvo 2018

机译：面向服务的制造系统中流程和调度的联合优化
7. Service-oriented architecture of adaptive, intelligent data acquisition and processing systems for long-pulse fusion experiments [O] . Gonzalez Joaquin, Ruiz González Mariano, Barrera Lopez de Turiso Eduardo, 2010

机译：面向服务的自适应，智能数据采集和处理系统的体系结构，用于长脉冲融合实验

MURS: Mitigating Memory Pressure in Service-oriented Data Processing Systems

摘要

著录项

相似文献

相关主题

期刊订阅