首页> 外文会议>IEEE International Conference on Big Data >Empowering Agroecosystem Modeling with HTC Scientific Workflows: The Cycles Model Use Case
【24h】

Empowering Agroecosystem Modeling with HTC Scientific Workflows: The Cycles Model Use Case

机译:使用HTC科学工作流程增强农业生态系统建模:周期模型用例

获取原文

摘要

Scientific workflows have enabled large-scale scientific computations and data analysis, and lowered the entry barrier for performing computations in distributed heterogeneous platforms (e.g., HTC and HPC). In spite of impressive achievements to date, large-scale modeling, simulation, and data analytics in the long-tail still face several challenges such as efficient scheduling and execution of large-scale workflows $(mathrm{O}(10^{6}))$ with very short-running tasks (few seconds). While the current trend to support next-generation workflows on leadership class machines have gained much attention in the past years, at the other end of the spectrum scientific workflows from the long-tail science have become larger and require processing massive volumes of data. In this paper, we report on our experience in designing and implementing an HTC workflow for agroecosystem modeling. We leverage well-known (task clustering and co-scheduling) and emerging (hierarchical workflows and containers) workflow optimization techniques to make the workflow planning problem tractable, and maximize resource utilization and the degree of task parallelism. Experimental results, via the implementation of a use case, show that by strategically combining the above strategies and defining an appropriate set of optimization parameters, the overall workflow makespan can be improved by 3.5 orders of magnitude when compared to a regular (non-optimized) execution of the workflow.
机译:科学工作流程实现了大规模科学计算和数据分析,并降低了在分布式异构平台(例如HTC和HPC)中执行计算的入门门槛。尽管迄今为止取得了令人瞩目的成就,但长尾的大规模建模,仿真和数据分析仍然面临一些挑战,例如有效调度和执行大规模工作流$(\ mathrm {O}(10 ^ {6 }))$具有非常短时间的任务(几秒钟)。在过去的几年中,尽管目前在领导级别机器上支持下一代工作流的趋势已引起了广泛关注,但另一方面,长尾科学的科学工作流却变得越来越大,需要处理大量数据。在本文中,我们报告了我们在设计和实施用于农业生态系统建模的HTC工作流程方面的经验。我们利用众所周知的(任务聚类和协同调度)和新兴的(分层工作流和容器)工作流优化技术,使工作流规划问题变得易于处理,并最大程度地利用资源和任务并行度。通过用例的实现,实验结果表明,与常规(未优化)相比,通过战略性地组合上述策略并定义一组适当的优化参数,可以将整个工作流程的有效期提高3.5个数量级。工作流程的执行。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号