首页> 外文会议>IEEE/ACM International Symposium on Cluster, Cloud and Internet Computing >Predicting Resource Requirement in Intermediate Palomar Transient Factory Workflow
【24h】

Predicting Resource Requirement in Intermediate Palomar Transient Factory Workflow

机译:预测中级Palomar临时工厂工作流程中的资源需求

获取原文

摘要

Quickly identifying astronomical transients from synoptic surveys is critical to many recent astrophysical discoveries. However, each of the data processing pipelines in these surveys contains dozens of stages with highly varying time and space requirements. Properly predicting the resources required to run these pipelines is critical for the allocation of computing resources and reducing the discovery response time. We propose a machine learning strategy for this prediction task and demonstrate its effectiveness using a set of timing measurements from the intermediate Palomar Transient Factory (iPTF) workflow. The proposed model utilizes the spatiotemporal correlation of astronomical images, where nearby patches of the sky (space) are likely to have a similar number of objects of interest and workflows executed in the recent past (time) are likely to use a similar amount of time because the machines and data storage systems are likely to be in similar states. We capture the relationship among these spatial and temporal features in a Bayesian network and study how they impact the prediction accuracy. This Bayesian network helps us to identify the most influential features for predictions. With proper features, our models achieve errors close to the random variance boundary within batches of images taken at the same time, which can be regarded as the intrinsic limit of prediction accuracy.
机译:从天气调查中快速识别天文瞬变对于许多最近的天文学发现是至关重要的。但是,这些调查中的每个数据处理管道都包含数十个阶段,这些阶段具有很大的时间和空间要求。正确预测运行这些管道所需的资源对于分配计算资源和减少发现响应时间至关重要。我们针对此预测任务提出了一种机器学习策略,并使用来自中间Palomar瞬态工厂(iPTF)工作流程的一组时序测量来证明其有效性。所提出的模型利用了天文图像的时空相关性,其中附近的天空(空间)斑块可能具有相似数量的关注对象,并且在最近的过去(时间)执行的工作流可能使用了相似的时间量。因为机器和数据存储系统可能处于相似状态。我们捕获贝叶斯网络中这些时空特征之间的关系,并研究它们如何影响预测准确性。贝叶斯网络帮助我们确定最有影响力的预测特征。通过适当的功能,我们的模型在同时拍摄的一批图像中实现了接近随机方差边界的误差,这可以被视为预测准确性的内在极限。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号