【24h】

Towards MLOps: A Case Study of ML Pipeline Platform

机译:朝MLOPS:毫克管道平台的案例研究

获取原文

摘要

The development and deployment of machine learning (ML) applications differ significantly from traditional applications in many ways, which have led to an increasing need for efficient and reliable production of ML applications and supported infrastructures. Though platforms such as TensorFlow Extended (TFX), ModelOps, and Kubeflow have provided end-to-end lifecycle management for ML applications by orchestrating its phases into multistep ML pipelines, their performance is still uncertain. To address this, we built a functional ML platform with DevOps capability from existing continuous integration (CI) or continuous delivery (CD) tools and Kubeflow, constructed and ran ML pipelines to train models with different layers and hyperparameters while time and computing resources consumed were recorded. On this basis, we analyzed the time and resource consumption of each step in the ML pipeline, explored the consumption concerning the ML platform and computational models, and proposed potential performance bottlenecks such as GPU utilization. Our work provides a valuable reference for ML pipeline platform construction in practice.
机译:机器学习(ML)应用的开发和部署在许多方面的传统应用中显着不同,导致越来越需要高效可靠地生产ML应用和支持的基础架构。虽然诸如Tensorflow扩展(TFX),Modelops和Kubeflow等平台,但通过将其阶段向MultiSep ML管道进行协调,虽然为ML应用程序提供了端到端的生命周期管理,但它们的性能仍然不确定。为了解决此问题,我们建立了一个功能型ML平台,具有来自现有的连续集成(CI)或连续交付(CD)工具和Kubeflow,构造和运行ML管道的功能ML平台,以培训具有不同层和超参数的模型,而在消耗的时间和计算资源记录。在此基础上,我们分析了ML管道中每一步的时间和资源消耗,探讨了ML平台和计算模型的消费,以及所提出的潜在性能瓶颈,如GPU利用率。我们的工作在实践中提供了ML管道平台建设的宝贵参考。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号