Performance enhancement for iterative data computing with in-memory concurrent processing

首页> 外文期刊>Concurrency, practice and experience >Performance enhancement for iterative data computing with in-memory concurrent processing

【24h】

Performance enhancement for iterative data computing with in-memory concurrent processing

机译：内存并发处理可提高迭代数据计算的性能

获取原文

获取原文并翻译 | 示例

开具论文收录证明 >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

The big data era has resulted in the development of several data analysis tools. Spark is a type of in-memory processing fitted iteration and interactive data mining tool. This tool possesses higher data-processing performance than MapReduce, which is an offline storage mechanism. However, some disadvantages of in-memory processing, such as massive in-memory data requirements, cause cross-node data transfer that result in a long computation time. The performance of the process can be improved if the in-memory process is executed with fewer shuffle instructions. Therefore, this study aims to enhance the performance of iterative application through instruction replacement. Three empirical research cases with diverse datasets and iterations are used to modify the program. We adopt a strategy of downloading a small resilient distributed dataset and replacing the shuffle-included instructions to shorten the processing time with an automated code replacement by using exhaustively code matching. The experimental results reveal an improvement of up to 39% in the execution time compared with the existing in-memory processing programs with various dataset sizes.

机译：大数据时代已导致开发了多种数据分析工具。 Spark是一种适合于内存处理的迭代和交互式数据挖掘工具。该工具比离线存储机制MapReduce具有更高的数据处理性能。但是，内存中处理的一些缺点（例如，大量的内存中数据需求）会导致跨节点数据传输，从而导致计算时间较长。如果使用较少的随机播放指令执行内存中进程，则可以提高进程的性能。因此，本研究旨在通过指令替换来增强迭代应用程序的性能。使用具有不同数据集和迭代的三个经验研究案例来修改程序。我们采用的策略是下载一个小的弹性分布式数据集，并替换掉包含随机播放的指令，以通过使用穷举代码匹配自动替换代码来缩短处理时间。实验结果表明，与现有的具有各种数据集大小的内存处理程序相比，执行时间最多可提高39％。

著录项

来源
《Concurrency, practice and experience》 |2020年第7期|e5593.1-e5593.16|共16页
作者

展开▼
作者单位

Natl Taipei Univ Grad Inst Informat Management New Taipei Taiwan;

Acad Sinica Inst Informat Sci Taipei Taiwan;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
iterative application; massive data; performance evaluation; program analysis; Spark;

机译：迭代应用;海量数据;绩效评估;程序分析;火花;

相似文献

外文文献
中文文献
专利

1. Mille Cheval: a GPU-based in-memory high-performance computing framework for accelerated processing of big-data streams [J] . Kumar Vivek, Sharma Dilip Kumar, Mishra Vinay Kumar Journal of supercomputing . 2021,第7期

机译：Mille Cheval：基于GPU的内存高性能计算框架，用于加速处理大数据流
2. Exploring a SOT-MRAM Based In-Memory Computing for Data Processing [J] . Zhezhi He, Yang Zhang, Shaahin Angizi, Multi-Scale Computing Systems, IEEE Transactions on . 2018,第4期

机译：探索用于数据处理的基于SOT-MRAM的内存计算
3. Performance comparison of in-memory and disk-based databases using transaction processing performance council (TPC) benchmarking [J] . Ayub M. B., Ali N. Journal of Internet and Information Systems . 2018,第1期

机译：使用事务处理性能理事会（TPC）基准测试比较内存数据库和基于磁盘的数据库的性能
4. Iterative sparse matrix-vector multiplication on in-memory cluster computing accelerated by GPUs for big data [C] . Jiwu Peng, Zheng Xiao, Cen Chen, 2016 12th International Conference on Natural Computation, Fuzzy Systems and Knowledge Discovery . 2016

机译：GPU加速大数据的内存集群计算中的迭代稀疏矩阵矢量乘法
5. Advanced Concurrency Control Algorithm Design and GPU System Support for High Performance In-Memory Data Management. [D] . Yuan, Yuan. 2016

机译：用于高性能内存数据管理的高级并发控制算法设计和GPU系统支持。
6. Improved Diagnostic Performance of New-generation 320-slice Computed Tomography with Forward-projected Model-based Iterative Reconstruction SoluTion for the Assessment of Late Enhancement in Left Ventricular Myocardium [O] . Hiroyuki Takaoka, Masae Uehara, Yuichi Saito, 2020

机译：提高新一代320切片计算机断层扫描的诊断性能与基于前进的模型的迭代重建解决方案用于评估左心室心肌晚期增强
7. GPUMemSort: A High Performance Graphics Co-processors Sorting Algorithm for Large Scale In-Memory Data [O] . Yin Ye, Zhihui Du, David A. Bader, 2013

机译：GpUmemsort：用于大规模内存数据的高性能图形协处理器排序算法
8. Design and Demonstration of RSFQ Processor Datapath for High Performance Computing. [R] . Kirichenko, A. F. 2014

机译：用于高性能计算的RsFQ处理器数据通路的设计和演示。

Performance enhancement for iterative data computing with in-memory concurrent processing

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅