...
首页> 外文期刊>EPJ Web of Conferences >Exploring the self-service model to visualize the results of the ATLAS Machine Learning analysis jobs in BigPanDA with Openshift OKD3
【24h】

Exploring the self-service model to visualize the results of the ATLAS Machine Learning analysis jobs in BigPanDA with Openshift OKD3

机译:探索自助式模型,可视化openshift OKD3的Bigpanda中的阿特拉斯机器学习分析作业的结果

获取原文
           

摘要

A large scientific computing infrastructure must offer versatility to host any kind of experiment that can lead to innovative ideas. The ATLAS experiment offers wide access possibilities to perform intelligent algorithms and analyze the massive amount of data produced in the Large Hadron Collider at CERN. The BigPanDA monitoring is a component of the PanDA (Production ANd Distributed Analysis) system, and its main role is to monitor the entire lifecycle of a job/task running in the ATLAS Distributed Computing infrastructure. Because many scientific experiments now rely upon Machine Learning algorithms, the BigPanDA community desires to expand the platform’s capabilities and fill the gap between Machine Learning processing and data visualization. In this regard, BigPanDA partially adopts the cloud-native paradigm and entrusts the data presentation to MLFlow services running on Openshift OKD. Thus, BigPanDA interacts with the OKD API and instructs the containers orchestrator how to locate and expose the results of the Machine Learning analysis. The proposed architecture also introduces various DevOps-specific patterns, including continuous integration for MLFlow middleware configuration and continuous deployment pipelines that implement rolling upgrades. The Machine Learning data visualization services operate on demand and run for a limited time, thus optimizing the resource consumption.
机译:大型科学计算基础设施必须提供多功能性,以举办可能导致创新思想的任何实验。 ATLAS实验提供了广泛的访问可能性,可以进行智能算法,并分析CERN的大型特罗龙撞机中产生的大量数据。 BigPanda Monitoring是熊猫(生产和分布式分析)系统的组成部分,其主要作用是监控在Atlas分布式计算基础架构中运行的作业/任务的整个生命周期。由于许多科学实验现在依赖于机器学习算法,因此BigPanda社区希望扩展平台的功能并填补机器学习处理和数据可视化之间的差距。在这方面,BigPanda部分采用云原生范例,并将数据呈现委托到运行OpenShift OKD上的MLFlow服务。因此,BigPanda与OKD API交互,并指示容器协调器如何定位和公开机器学习分析的结果。该建议的架构还介绍了各种专用模式,包括持续集成MLFLOF中间件配置和实现滚动升级的连续部署管道。机器学习数据可视化服务按需运行并在有限的时间内运行,从而优化资源消耗。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号