Performance Modeling for Spark Using SVM

机译：使用SVM进行Spark性能建模

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

At present, Spark is widely used in a number of enterprises. Although Spark is much faster than Hadoop for some applications, its configuration parameters can have a great impact on its performance due to the large number of the parameters, interaction between them, and various characteristics of applications as well. Unfortunately, there is not yet any research conducted to predict the performance of Spark based on its configuration sets.In this paper, we employ a machine learning method-Support Vector Machine(SVM) to build performance models for Spark. The input of configuration sets is collected by running Spark application previously with randomly modified and combined parameter values. In this way, we also determine the range of each property and gain a deeper understanding about how these properties work in Spark. We also use Artificial Neural Network to model the performance of Spark and find that the error rate of ANN is on average 1.98 times that of SVM for three workloads from HiBench.

机译：目前，Spark已在许多企业中广泛使用。尽管对于某些应用程序，Spark比Hadoop快得多，但是由于大量的参数，它们之间的交互以及应用程序的各种特性，Spark的配置参数可能对其性能产生很大的影响。遗憾的是，目前尚未进行任何基于其配置集来预测Spark性能的研究。本文采用一种机器学习方法-支持向量机（SVM）来构建Spark性能模型。通过先前运行带有随机修改和组合参数值的Spark应用程序来收集配置集的输入。通过这种方式，我们还可以确定每个属性的范围，并对这些属性在Spark中的工作方式有更深入的了解。我们还使用人工神经网络对Spark的性能进行建模，发现对于HiBench的三种工作负载，ANN的错误率平均为SVM的1.98倍。

著录项

来源
《International Conference on Cloud Computing and Big Data》|2016年|127-131|共5页
会议地点
作者
Ni Luo; Zhibin Yu; Zhendong Bei; Chengzhong Xu; Chuntao Jiang; Lingfeng Lin;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Sparks; Support vector machines; Data models; Benchmark testing; Error analysis; Memory management; Training;

机译：Sparks;支持向量机;数据模型;基准测试;错误分析;内存管理;培训;

相似文献

外文文献
中文文献
专利

1. A Numerical Simulation Study on Improving the Thermal Efficiency of a Spark Ignited Engine - Part 1: Modeling of a Spark Ignited Engine Combustion to Predict Engine Performance Considering Flame Propagation, Knock, and Combustion Chamber Wall [J] . Akira Kikusato, Kusaka Jin, Yasuhiro Daisho SAE International Journal of Engines . 2014,第1期

机译：改善火花点火式发动机热效率的数值模拟研究-第1部分：考虑火焰传播，爆震和燃烧室壁的火花点火式发动机燃烧建模以预测发动机性能
2. Improvement of carcinogenicity prediction performances based on sensitivity analysis in variable selection of SVM models [J] . Tanabe K., Kurita T., Nishida K., SAR and QSAR in Environmental Research . 2013,第7a9期

机译：SVM模型变量选择中基于敏感性分析的致癌性预测性能的提高
3. Performance analysis of sentiments in Twitter dataset using SVM models [J] . Lakshmana Kumar Ramasamy, Seifedine Kadry, Yunyoung Nam, International Journal of Electrical and Computer Engineering . 2021,第3期

机译：使用SVM模型的Twitter DataSet情绪分析
4. Performance Modeling for Spark Using SVM [C] . Ni Luo, Zhibin Yu, Zhendong Bei, International Conference on Cloud Computing and Big Data . 2016

机译：使用SVM的Spark性能建模
5. Simplified Sizing and Performance Models for Four-Stroke Spark-Ignition Engines for use in Electric Vehicle Range Extenders. [D] . Brubaker, Jason Tyler. 2014

机译：用于电动汽车增程器的四冲程火花点火发动机的简化尺寸和性能模型。
6. Constructing and Validating High-Performance MIEC-SVM Models in Virtual Screening for Kinases: A Better Way for Actives Discovery [O] . Huiyong Sun, Peichen Pan, Sheng Tian, -1

机译：在激酶的虚拟筛选中构建和验证高性能MIEC-SVM模型：发现活性物的更好方法
7. Table 6: Comparison of the performances of our model (OUR-SVM) with Tregellas et al.’s (2018) two best models based on linear regression (T-LR) and support vector machines (T-SVM). [O] . -1

机译：表6：使用Tregellas等人的模型（我们-SVM）的表演比较。（2018）基于线性回归（T-LR）的两个最佳模型，并支持向量机（T-SVM）。

Performance Modeling for Spark Using SVM

摘要

著录项

相似文献

相关主题

期刊订阅