首页> 外文会议>International Workshops on ISC High Performance >Converging HPC, Big Data and Cloud Technologies for Precision Agriculture Data Analytics on Supercomputers
【24h】

Converging HPC, Big Data and Cloud Technologies for Precision Agriculture Data Analytics on Supercomputers

机译:在超级计算机上为精密农业数据分析进行HPC,大数据和云技术

获取原文

摘要

The convergence of HPC and Big Data along with the influence of Cloud are playing an important role in the democratization of HPC. The increasing needs of Data Analytics in computational power has added new fields of interest for the HPC facilities but also new problematics such as interoperability with Cloud and ease of use. Besides the typical HPC applications, these infrastructures are now asked to handle more complex workflows combining Machine Learning, Big Data and HPC. This brings challenges on the resource management, scheduling and environment deployment layers. Hence, enhancements are needed to allow multiple frameworks to be deployed under common system management while providing the right abstraction to facilitate adoption. This paper presents the architecture adopted for the parallel and distributed execution management software stack of Cybele EU funded project which is put in place on production HPC centers to execute hybrid data analytics workflows in the context of precision agriculture and livestock farming applications. The design is based on: Kubernetes as a higher level orchestrator of Big Data components, hybrid workflows and a common interface to submit HPC or Big Data jobs; Slurm or Torque for HPC resource management; and Singularity containeriza-tion platform for the dynamic deployment of the different Data Analytics frameworks on HPC. The paper showcases precision agriculture workflows being executed upon the architecture and provides some initial performance evaluation results and insights for the whole prototype design.
机译:HPC和大数据的融合以及云的影响在HPC的民主化中发挥着重要作用。计算能力中数据分析的日益增长的需求增加了HPC设施的新兴趣领域,而且还为具有云和易用性的互操作性等新的问题。除了典型的HPC应用程序,这些基础架构现在被要求处理更复杂的工作流组合机器学习,大数据和HPC。这为资源管理,调度和环境部署层带来了挑战。因此,需要增强,以允许在公共系统管理下部署多个框架,同时提供正确的抽象以促进采用。本文介绍了Cybele欧盟资助项目的并行和分布式执行管理软件堆栈所采用的架构,该项目置于生产HPC中心,以在精密农业和牲畜养殖应用中执行混合数据分析工作流。该设计基于:Kubernetes作为大数据组件,混合工作流和常用接口的更高级别的协调,以提交HPC或大数据作业;用于HPC资源管理的Slurm或扭矩;和奇点Containeriza-Tion平台,用于HPC上不同数据分析框架的动态部署。本文展示了在架构上执行的精确农业工作流程,并为整个原型设计提供了一些初始性能评估结果和见解。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号