...
首页> 外文期刊>Computers, Materials & Continua >A Dynamic Memory Allocation Optimization Mechanism Based on Spark
【24h】

A Dynamic Memory Allocation Optimization Mechanism Based on Spark

机译:基于火花的动态内存分配优化机制

获取原文
获取原文并翻译 | 示例

摘要

Spark is a distributed data processing framework based on memory. Memory allocation is a focus question of Spark research. A good memory allocation scheme can effectively improve the efficiency of task execution and memory resource utilization of the Spark. Aiming at the memory allocation problem in the Spark2.x version, this paper optimizes the memory allocation strategy by analyzing the Spark memory model, the existing cache replacement algorithms and the memory allocation methods, which is on the basis of minimizing the storage area and allocating the execution area according to the demand. It mainly including two parts: cache replacement optimization and memory allocation optimization. Firstly, in the storage area, the cache replacement algorithm is optimized according to the characteristics of RDD Partition, which is combined with PC A dimension. In this section, the four features of RDD Partition are selected. When the RDD cache is replaced, only two most important features are selected by PCA dimension reduction method each time, thereby ensuring the generalization of the cache replacement strategy. Secondly, the memory allocation strategy of the execution area is optimized according to the memory requirement of Task and the memory space of storage area. In this paper, a series of experiments in Spark on Yarn mode are carried out to verify the effectiveness of the optimization algorithm and improve the cluster performance.
机译:Spark是基于内存的分布式数据处理框架。内存分配是Spark Research的焦点问题。良好的内存分配方案可以有效地提高火花的任务执行和存储器资源利用的效率。针对Spark2.x版本中的内存分配问题,本文通过分析Spark Memory模型,现有的高速缓存替换算法和内存分配方法来优化内存分配策略,这是基于最小化存储区域和分配执行区域根据需求。它主要包括两部分:缓存替换优化和内存分配优化。首先,在存储区域中,根据RDD分区的特性优化高速缓存替换算法,其与PC维度组合。在本节中,选择了RDD分区的四个功能。当替换RDD高速缓存时,每次仅通过PCA尺寸减少方法选择两个最重要的特征,从而确保高速缓存替换策略的泛化。其次,根据任务的内存要求和存储区域的存储空间进行优化执行区域的存储器分配策略。在本文中,进行了一系列在纱线模式下的火花实验,以验证优化算法的有效性,提高集群性能。

著录项

  • 来源
    《Computers, Materials & Continua 》 |2019年第2期| 739-757| 共19页
  • 作者单位

    College of Information Technology Hebei University of Economics and Business Shijiazhuang 050061 China;

    College of Information Technology Hebei University of Economics and Business Shijiazhuang 050061 China;

    College of Information Technology Hebei University of Economics and Business Shijiazhuang 050061 China;

    College of Mathematics and Computer Science Xinyu University Xinyu 338004 China;

    College of Mathematics and Computer Science Xinyu University Xinyu 338004 China;

    College of Mathematics and Computer Science Xinyu University Xinyu 338004 China;

    College of Mathematics and Computer Science Xinyu University Xinyu 338004 China;

    School of Information Engineering Jiangsu Polytechnic College of Agriculture and Forestry Jurong 212400 China;

    School of Computer Science University College Dublin Dublin 4 Ireland;

    College of Information Engineering Sanming University Sanming 365004 China;

  • 收录信息
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

    Memory calculation; memory allocation optimization; cache replacement optimization;

    机译:记忆计算;内存分配优化;缓存替换优化;

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号