首页> 外文会议>IEEE Infrastructure Conference >Faster Scalable ML Model Deployment Using ONNX and Open Source Tools

【24h】

Faster Scalable ML Model Deployment Using ONNX and Open Source Tools

机译：使用ONNX和开源工具更快的可扩展ML模型部署

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Summary form only given, as follows. The complete presentation was not made available for publication as part of the conference proceedings. As ML developments shift from research to real world, we encounter many deployment challenges. Teams may be experimenting with various training frameworks, with deployments targeting multiple platforms and hardware. While training using one framework with one hardware target can easily be managed, it becomes challenging with a matrix of multiple frameworks and deployment targets. This fragmented ecosystem introduces deployment complexities and oftentimes custom code is needed to maximize performance for each scenario, which is time-consuming to maintain when models are updated. To streamline this, the interoperable ONNX model format and ONNX Runtime inference engine can be utilized to deploy models performantly across a variety of hardware. Models trained from PyTorch, Tensorflow, scikit-learn, CoreML, and more can all be converted to the common ONNX format, and the model can then be inferenced using the cross-platform performance-focused ONNX Runtime inference engine, which supports various hardware options for acceleration across CPU and GPUs. ONNX Runtime is already used in key Microsoft services, on average realizing 2x performance improvements. In this session, we share an overview of ONNX Runtime, success stories and usage examples from high volume product groups at Microsoft, and demonstrate ways to integrate this into your AI workflows for immediate impact.

机译：摘要只给出，如下所述。完整的陈述未作为会议诉讼程序的一部分提供出版物。作为ML的发展从科研到现实世界的变化，我们会遇到许多部署难题。参赛队可以与各种培训框架试验，与部署针对多个平台和硬件。当使用一个硬件目标一个框架培训即可轻松进行管理，就成了与多个框架和部署目标的矩阵挑战。这个生态系统中部署介绍复杂性，常常自定义代码，需要最大限度地为每一个场景，这是耗时的，当模型更新，以保持性能。为了简化这一点，可互操作的ONNX模型格式和ONNX运行推理引擎可用于performantly跨多种硬件的部署模型。从PyTorch，Tensorflow，scikit学习，CoreML训练的模型，以及更多的都可以转换为普通ONNX格式和模型，然后可以使用跨平台进行推断的注重性能ONNX运行推理引擎，它支持各种硬件选项跨CPU和GPU的加速。 ONNX运行系统中关键的Microsoft服务已被使用，平均实现2倍的性能提升。在这个环节，我们分享ONNX运行时，微软的成功案例和使用的例子从大批量的产品组的概述，并展示方式，这种集成到您的AI工作流程直接影响。

著录项

来源
《IEEE Infrastructure Conference》|2020年|i-i|共1页
会议地点
作者

展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. Thermal modelling of large scale exploitation of ground source energy in urban aquifers as a resource management tool [J] . Alan Herbert, Simon Arthur, Grace Chillingworth Applied Energy . 2013,第sepa期

机译：作为资源管理工具的城市含水层中大规模开采地源能量的热模拟
2. Implicit Modeling Tool: A Fast and Innovative Way for Resource Modeling in this Era [J] . Suryanshu Choudhury The Indian mining & engineering journal . 2017,第12期

机译：隐式建模工具：这一时代资源建模的快速创新方式
3. GenESysV: a fast, intuitive and scalable genome exploration open source tool for variants generated from high-throughput sequencing projects [J] . Mohammad Zia, Paul Spurgeon, Adrian Levesque, BMC Bioinformatics . 2019,第1期

机译：GenesySV：一种快速，直观和可扩展的基因组勘探开放源工具，用于从高通量测序项目产生的变体
4. On the Workload Deployment, Resource Utilization and Operational Cost of Fast Optical Switch Based Rack-Scale Disaggregated Data Center Network [C] . Xiaotao Guo, Fulong Yan, George Exarchakos, Optical Fiber Communications Conference and Exhibition . 2020

机译：基于快速光交换的机架规模分解数据中心网络的工作量部署，资源利用和运营成本
5. Fast Optimization for Scalable Application Deployments in Large Service Centers [D] . Li, Jim Zhanwen 2011

机译：快速优化大型服务中心中的可伸缩应用程序部署
6. GenESysV: a fast intuitive and scalable genome exploration open source tool for variants generated from high-throughput sequencing projects [O] . Mohammad Zia, Paul Spurgeon, Adrian Levesque, 2019

机译：GenESysV：快速直观可扩展的基因组探索开源工具用于处理高通量测序项目产生的变异
7. Incumbent Responses to an Entrant with a New Business Model: Resource Co-Deployment and Resource Re-Deployment Strategies [O] . Ahuja G., Novelli E. 2016

机译：新业务模型对入职者的现有响应：资源共同部署和资源重新部署策略

Faster Scalable ML Model Deployment Using ONNX and Open Source Tools

摘要

著录项

相似文献

相关主题

期刊订阅