首页> 外文会议>High performance computing >Machine Learning Using Virtualized GPUs in Cloud Environments

【24h】

Machine Learning Using Virtualized GPUs in Cloud Environments

机译：在云环境中使用虚拟GPU进行机器学习

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

Using graphic processing units (GPU) to accelerate machine learning applications has become a focus of high performance computing (HPC) in recent years. In cloud environments, many different cloud-based GPU solutions have been introduced to seamlessly and securely use GPU resources without sacrificing their performance benefits. Among them are two main approaches: using direct pass-through technologies available on hypervisors and using virtual GPU technologies introduced by GPU vendors. In this paper, we present a performance study of these two GPU virtualization solutions for machine learning in the cloud. We evaluate the advantages and disadvantages of each solution and introduce new findings of their performance impact on machine learning applications in different real-world use-case scenarios. We also examine the benefits of virtual GPUs for machine learning alone and for machine learning applications running together with other GPU-based applications like 3D-graphics on the same server with multiple GPUs to better leverage computing resources. Based on our experimental results benchmarking machine learning applications developed with TensorFlow, we discuss the scaling from one to multiple GPUs and compare the performance between two virtual GPU solutions. Finally, we show that mixing machine learning and other GPU-based workloads can help to reduce combined execution time as compared to running these workloads sequentially.

机译：近年来，使用图形处理单元（GPU）加速机器学习应用已成为高性能计算（HPC）的重点。在云环境中，已引入了许多不同的基于云的GPU解决方案，以在不牺牲性能优势的情况下无缝安全地使用GPU资源。其中有两种主要方法：使用虚拟机管理程序上可用的直接传递技术以及使用GPU供应商引入的虚拟GPU技术。在本文中，我们对这两种用于云中机器学习的GPU虚拟化解决方案进行了性能研究。我们评估了每种解决方案的优缺点，并介绍了它们在不同的实际用例场景中对机器学习应用程序的性能影响的新发现。我们还将研究虚拟GPU的优势，这些优势不仅可用于单独的机器学习，还可用于与其他基于GPU的应用程序一起运行的机器学习应用程序（例如在具有多个GPU的同一服务器上的3D图形）一起更好地利用计算资源。基于我们使用TensorFlow开发的基准测试机器学习应用程序的实验结果，我们讨论了从一个GPU扩展到多个GPU的情况，并比较了两个虚拟GPU解决方案之间的性能。最后，我们证明，与顺序运行这些工作负载相比，将机器学习和其他基于GPU的工作负载混合在一起可以帮助减少合并的执行时间。

著录项

来源
《High performance computing 》|2017年|591-604|共14页
会议地点 Frankfurt(DE)
作者
Uday Kurkure; Hari Sivaraman; Lan Vu;
展开▼
作者单位

VMware, Palo Alto, CA 94304, USA;

VMware, Palo Alto, CA 94304, USA;

VMware, Palo Alto, CA 94304, USA;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类
关键词
Machine learning; Virtualization; GPU; High performance computing; Cloud computing; DirectPath I/O; GRID vGPU;

机译：机器学习；虚拟化; GPU;高性能计算；云计算; DirectPath I / O；网格vGPU;

相似文献

外文文献
中文文献
专利

1. An autonomous model for self-optimizing virtual machine selection by learning automata in cloud environment [J] . Najafizadegan Negin, Nazemi Eslam, Khajehvand Vahid Software, practice & experience . 2021 ,第6期

机译：云环境中自动化自动化虚拟机选择的自主模型
2. Learning-based dynamic scalable load-balanced firewall as a service in network function-virtualized cloud computing environments [J] . Dezhabad Naghmeh, Sharifian Saeed Journal of supercomputing . 2018 ,第7期

机译：网络功能虚拟化云计算环境中基于学习的动态可扩展负载平衡防火墙即服务
3. A Virtual Machine Consolidation Algorithm Based on Ant Colony System and Extreme Learning Machine for Cloud Data Center [J] . Liu Fagui, Ma Zhenjiang, Wang Bin, Quality Control, Transactions . 2020 ,第期

机译：基于蚁群系统和云数据中心极限学习机的虚拟机整合算法
4. Machine Learning Using Virtualized GPUs in Cloud Environments [C] . Uday Kurkure, Hari Sivaraman, Lan Vu ISC High Performance Conference . 2017

机译：在云环境中使用虚拟化GPU的机器学习
5. Optimizing virtual machine I/O performance in cloud environments. [D] . Lu, Tao. 2016

机译：在云环境中优化虚拟机I / O性能。
6. Securing Machine Learning in the Cloud: A Systematic Review of Cloud Machine Learning Security [O] . Adnan Qayyum, Aneeqa Ijaz, Muhammad Usama, 2020

机译：保护机器学习在云中：对云机学习安全的系统综述
7. Enabling GPU Virtualization in Cloud Environments [O] . Sergio Iserte, Francisco J. Clemente-Castelló, Adrián Castelló, 2016

机译：在云环境中启用GPU虚拟化

Machine Learning Using Virtualized GPUs in Cloud Environments

摘要

著录项

相似文献

相关主题

期刊订阅