首页> 外文会议>2012 SC Companion: High Performance Computing, Networking, Storage and Analysis. >Predicting the Execution Time of Workflow Activities Based on Their Input Features
【24h】

Predicting the Execution Time of Workflow Activities Based on Their Input Features

机译:根据输入特征预测工作流活动的执行时间

获取原文
获取原文并翻译 | 示例

摘要

The ability to accurately estimate the execution time of computationally expensive e-science algorithms enables better scheduling of workflows that incorporate those algorithms as their building blocks, and may give users an insight into the expected cost of workflow execution on cloud resources. When a large history of past runs can be observed, crude estimates such as the average execution time can easily be provided. We make the hypothesis that, for some algorithms, better estimates can be obtained by using the histories to learn regression models that predict execution time based on selected features of their inputs. We refer to this property as input predictability of algorithms. We are motivated by e-science workflows that involve repetitive training of multiple learning models. Thus, we verify our hypothesis on the specific case of the C4.5 decision tree builder, a well-known learning method whose training execution time is indeed sensitive to the specific input dataset, but in non- obvious ways. We use the case study to demonstrate a method for assessing input predictability. While this yields promising results, we also find that its more general applicability involves a trade off between the black-box nature of the algorithms under analysis, and the need for expert insight into relevant features of their inputs.
机译:能够准确估算计算上昂贵的电子科学算法的执行时间的能力可以更好地调度将这些算法作为其构建基块的工作流,并可以使用户洞悉云资源上工作流执行的预期成本。当可以观察到大量的过去运行历史时,可以轻松提供诸如平均执行时间之类的粗略估计。我们假设,对于某些算法,可以通过使用历史学习回归模型来获得更好的估计,这些模型基于输入的选定特征预测执行时间。我们将此属性称为算法的输入可预测性。我们受到涉及重复学习多种学习模型的电子科学工作流程的激励。因此,我们在C4.5决策树构建器的特定情况下验证了我们的假设,C4.5决策树构建器是一种众所周知的学习方法,其训练执行时间的确对特定输入数据集敏感,但以非显而易见的方式。我们使用案例研究来演示评估输入可预测性的方法。虽然这产生了令人鼓舞的结果,但我们还发现,它的更广泛的适用性涉及正在分析的算法的黑盒性质与需要专家了解其输入的相关特征之间的权衡。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号