Towards MLOps: A Case Study of ML Pipeline Platform

机译：朝MLOPS：毫克管道平台的案例研究

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

The development and deployment of machine learning (ML) applications differ significantly from traditional applications in many ways, which have led to an increasing need for efficient and reliable production of ML applications and supported infrastructures. Though platforms such as TensorFlow Extended (TFX), ModelOps, and Kubeflow have provided end-to-end lifecycle management for ML applications by orchestrating its phases into multistep ML pipelines, their performance is still uncertain. To address this, we built a functional ML platform with DevOps capability from existing continuous integration (CI) or continuous delivery (CD) tools and Kubeflow, constructed and ran ML pipelines to train models with different layers and hyperparameters while time and computing resources consumed were recorded. On this basis, we analyzed the time and resource consumption of each step in the ML pipeline, explored the consumption concerning the ML platform and computational models, and proposed potential performance bottlenecks such as GPU utilization. Our work provides a valuable reference for ML pipeline platform construction in practice.

机译：机器学习（ML）应用的开发和部署在许多方面的传统应用中显着不同，导致越来越需要高效可靠地生产ML应用和支持的基础架构。虽然诸如Tensorflow扩展（TFX），Modelops和Kubeflow等平台，但通过将其阶段向MultiSep ML管道进行协调，虽然为ML应用程序提供了端到端的生命周期管理，但它们的性能仍然不确定。为了解决此问题，我们建立了一个功能型ML平台，具有来自现有的连续集成（CI）或连续交付（CD）工具和Kubeflow，构造和运行ML管道的功能ML平台，以培训具有不同层和超参数的模型，而在消耗的时间和计算资源记录。在此基础上，我们分析了ML管道中每一步的时间和资源消耗，探讨了ML平台和计算模型的消费，以及所提出的潜在性能瓶颈，如GPU利用率。我们的工作在实践中提供了ML管道平台建设的宝贵参考。

著录项

来源
《International Conference on Artificial Intelligence and Computer Engineering》|2020年|494-500|共7页
会议地点
作者
Yue Zhou; Yue Yu; Bo Ding;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Training; Computational modeling; Pipelines; Graphics processing units; Tools; Data models; Task analysis;

机译：培训;计算建模;管道;图形处理单元;工具;数据模型;任务分析;

相似文献

外文文献
中文文献
专利

1. Analytical Platforms 3: Processing Samples via the RPPA Pipeline to Generate Large-Scale Data for Clinical Studies [J] . Siwak Doris R., Li Jun, Akbani Rehan, Advances in Experimental Medicine and Biology . 2019,第期

机译：分析平台3：通过RPPA管道处理样品，为临床研究产生大规模数据
2. RaMWAS: fast methylome-wide association study pipeline for enrichment platforms [J] . Shabalin Andrey A., Hattab Mohammad W., Clark Shaunna L., Bioinformatics . 2018,第13期

机译：RAMWAS：丰富的甲基族协会学习管道，用于富集平台
3. AGILE-ACCORD: A Randomized, Multicentre, Seamless, Adaptive Phase I/II Platform Study to Determine the Optimal Dose, Safety and Efficacy of Multiple Candidate Agents for the Treatment of COVID-19: A structured summary of a study protocol for a randomised platform trial [J] . Gareth Griffiths, Richard Fitzgerald, Thomas Jaki, Trials . 2020,第1期

机译：Agile-Accord：随机，多期中心，无缝，自适应相I / II平台研究，以确定多个候选药物治疗Covid-19的最佳剂量，安全性和功效：随机平台的研究方案的结构化概述审判
4. GEOTECHNICAL STUDY TO DESIGN A GIS PLATFORM SUBJECTED TO PIPELINE IN PERMAFROST AREA [C] . Sewon KIM, YoungSeok KIM International Conference on Ocean, Offshore and Arctic Engineering . 2020

机译：岩土学研究设计在多冻地区经过管道的GIS平台
5. Improving the Performance of Long-Running Scientific Pipelines in a Bioinformatics Pipeline Platform [D] . ?Tong, Hao 2020

机译：提高长期运行的科学管线在生物信息学管道平台性能
6. RaMWAS: fast methylome-wide association study pipeline for enrichment platforms [O] . Andrey A Shabalin, Mohammad W Hattab, Shaunna L Clark, -1

机译：RaMWAS：用于浓缩平台的快速全甲基组关联研究流程
7. An Effective Processing Pipeline for Harmonizing DNA Methylation Data from Illumina’s 450K and EPIC Platforms for Epidemiological Studies [O] . Lauren A Vanderlinden, Randi K Johnson, Patrick M Carry, 2020

机译：一种有效的处理管道，用于协调Illumina 450K和史诗平台的流行病学研究的DNA甲基化数据

Towards MLOps: A Case Study of ML Pipeline Platform

摘要

著录项

相似文献

相关主题

期刊订阅