Synthesizing benchmarks for predictive modeling

机译：综合基准以进行预测建模

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Predictive modeling using machine learning is an effective method for building compiler heuristics, but there is a shortage of benchmarks. Typical machine learning experiments outside of the compilation field train over thousands or millions of examples. In machine learning for compilers, however, there are typically only a few dozen common benchmarks available. This limits the quality of learned models, as they have very sparse training data for what are often high-dimensional feature spaces. What is needed is a way to generate an unbounded number of training programs that finely cover the feature space. At the same time the generated programs must be similar to the types of programs that human developers actually write, otherwise the learning will target the wrong parts of the feature space. We mine open source repositories for program fragments and apply deep learning techniques to automatically construct models for how humans write programs. We sample these models to generate an unbounded number of runnable training programs. The quality of the programs is such that even human developers struggle to distinguish our generated programs from hand-written code. We use our generator for OpenCL programs, CLgen, to automatically synthesize thousands of programs and show that learning over these improves the performance of a state of the art predictive model by 1.27x. In addition, the fine covering of the feature space automatically exposes weaknesses in the feature design which are invisible with the sparse training examples from existing benchmark suites. Correcting these weaknesses further increases performance by 4.30x.

机译：使用机器学习进行预测建模是构建编译器试探法的有效方法，但是缺乏基准。编译领域之外的典型机器学习实验训练了成千上万个示例。但是，在针对编译器的机器学习中，通常只有几十个通用基准可用。这限制了学习模型的质量，因为它们对于通常是高维特征空间的训练数据非常稀疏。所需要的是一种生成无数训练程序的方法，这些训练程序可以很好地覆盖特征空间。同时，生成的程序必须类似于人类开发人员实际编写的程序类型，否则学习将针对特征空间的错误部分。我们挖掘用于程序片段的开源资源库，并应用深度学习技术来自动构建用于人类编写程序的模型。我们对这些模型进行采样以生成无数可运行的训练程序。程序的质量是如此之高，以至于人类开发人员都难以将我们生成的程序与手写代码区分开。我们使用OpenCL程序生成器CLgen来自动合成数千个程序，并表明对这些程序的学习可以将最新的预测模型的性能提高1.27倍。此外，对特征空间的精细覆盖自动暴露了特征设计中的弱点，而这些弱点在现有基准套件的稀疏训练示例中是看不到的。纠正这些弱点后，性能进一步提高了4.30倍。

著录项

来源
《IEEE/ACM International Symposium on Code Generation and Optimization》|2017年|86-99|共14页
会议地点
作者
Chris Cummins; Pavlos Petoumenos; Zheng Wang; Hugh Leather;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Benchmark testing; Predictive models; Training; Machine learning; Grammar; Semantics; Data models;

机译：基准测试;预测模型;培训;机器学习;语法;语义;数据模型;

相似文献

外文文献
中文文献
专利

1. Benchmarking QSP Models Against Simple Models: A Path to Improved Comprehension and Predictive Performance [J] . Andrew M. Stein, Michael Looby CPT: Pharmacometrics & Systems Pharmacology . 2018,第8期

机译：对照简单模型对QSP模型进行基准测试：提高理解力和预测性能的途径
2. Three-dimensional modeling of laser-plasma interaction: Benchmarking our predictive modeling tools versus experiments [J] . Divol L, Berger RL, Meezan NB, Physics of plasmas . 2008,第5aPta2期

机译：激光-等离子体相互作用的三维建模：将我们的预测建模工具与实验进行基准比较
3. Three-dimensional modeling of laser-plasma interaction: Benchmarking our predictive modeling tools versus experiments [J] . L. Divol, R. L. Berger, N. B. Meezan, Physics of plasmas . 2008,第5aPta2期

机译：激光-等离子体相互作用的三维建模：将我们的预测建模工具与实验进行基准比较
4. Synthesizing benchmarks for predictive modeling [C] . Chris Cummins, Pavlos Petoumenos, Zheng Wang, IEEE/ACM International Symposium on Code Generation and Optimization . 2017

机译：合成预测建模基准测试
5. Experimental Benchmark of Computational Fluid Dynamics Models to Predict Used Nuclear Fuel Cladding Temperatures during Vacuum Drying Conditions [D] . Maharjan, Dilesh. 2018

机译：计算流体动力学模型预测真空干燥条件下用过的核燃料熔覆温度的实验基准
6. Benchmarking QSP Models Against Simple Models: A Path to Improved Comprehension and Predictive Performance [O] . Andrew M. Stein, Michael Looby 2018

机译：对照简单模型对QSP模型进行基准测试：提高理解力和预测性能的途径
7. Synthesizing benchmarks for predictive modeling [O] . Cummins Chris, Petoumenos Pavlos, Wang Zheng, 2017

机译：综合基准以进行预测建模
8. Three Dimensional Modeling of Laser-Plasma Interaction: Benchmarking our Predictive Modeling Tools vs. Experiments [R] . Divol, L., Berger, R., Meezan, N., 2007

机译：激光 - 等离子体相互作用的三维建模：基于我们的预测建模工具与实验的基准

Synthesizing benchmarks for predictive modeling

摘要

著录项

相似文献

相关主题

期刊订阅