首页> 外文会议>IEEE International Conference on Web Services >MURS: Mitigating Memory Pressure in Service-oriented Data Processing Systems
【24h】

MURS: Mitigating Memory Pressure in Service-oriented Data Processing Systems

机译:Murs:减轻面向服务的数据处理系统中的记忆压力

获取原文

摘要

Although a data processing system often works as a batch processing system, many enterprises deploy such a system as a service, which we call the service-oriented data processing system. It has been shown that in-memory data processing systems suffer from serious memory pressure. The situation becomes even worse for the service-oriented data processing systems due to various reasons. For example, in a service-oriented system, multiple submitted tasks are launched at the same time and executed in the same context in the resources, compared with the batch processing mode where the tasks are processed one by one. Therefore, the memory pressure will affect all submitted tasks, including the tasks that only incur the light memory pressure when they are run alone. In this paper, we find that the reason why memory pressure arises is because the running tasks produce massive long-living data objects in the limited memory space. Our studies further reveal that the long-living data objects are generated by the API functions that are invoked by the in-memory processing frameworks. Based on these findings, we propose a method to classify the API functions based on the memory usage rate. Further, we design a scheduler called MURS to mitigate the memory pressure. We implement MURS in Spark and conduct the experiments to evaluate the performance of MURS. The results show that when comparing to Spark, MURS can 1) decrease the execution time of the submitted jobs by up to 65.8%, 2) mitigate the memory pressure in the server by decreasing the garbage collection time by up to 81%, and 3) reduce the data spilling, and hence disk I/O, by approximately 90%.
机译:虽然数据处理系统通常作为批处理系统工作,但许多企业将如此系统部署为服务,我们称之为面向服务的数据处理系统。已经表明,内存数据处理系统遭受严重的记忆压力。由于各种原因,面向服务的数据处理系统的情况变得更糟。例如,在面向服务的系统中,与批处理模式相比,在资源中的相同上下文中同时启动多个提交的任务,并在其中一个接一个地处理任务。因此,记忆压力将影响所有提交的任务,包括仅在单独运行时引起灯记忆压力的任务。在本文中,我们发现内存压力出现的原因是因为运行任务在有限的存储空间中产生大量的长生物数据对象。我们的研究进一步揭示了长生物数据对象由内存处理框架调用的API函数生成。基于这些发现,我们提出了一种基于内存使用率对API函数进行分类的方法。此外,我们设计一个名为Murs的调度程序来缓解内存压力。我们在火花中实施Murs,并进行实验以评估Murs的表现。结果表明,当与火花相比,Murs Can 1)将提交工作的执行时间减少到65.8%,2)通过将垃圾收集时间减少到81%,3时,减轻服务器中的内存压力)减少数据溢出,因此磁盘I / O,大约90%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号