vDNN: Virtualized deep neural networks for scalable, memory-efficient neural network design

机译：vDNN：虚拟化深度神经网络，可扩展，内存高效的神经网络设计

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

The most widely used machine learning frameworks require users to carefully tune their memory usage so that the deep neural network (DNN) fits into the DRAM capacity of a GPU. This restriction hampers a researcher's flexibility to study different machine learning algorithms, forcing them to either use a less desirable network architecture or parallelize the processing across multiple GPUs. We propose a runtime memory manager that virtualizes the memory usage of DNNs such that both GPU and CPU memory can simultaneously be utilized for training larger DNNs. Our virtualized DNN (vDNN) reduces the average GPU memory usage of AlexNet by up to 89%, OverFeat by 91%, and GoogLeNet by 95%, a significant reduction in memory requirements of DNNs. Similar experiments on VGG-16, one of the deepest and memory hungry DNNs to date, demonstrate the memory-efficiency of our proposal. vDNN enables VGG-16 with batch size 256 (requiring 28 GB of memory) to be trained on a single NVIDIA Titan X GPU card containing 12 GB of memory, with 18% performance loss compared to a hypothetical, oracular GPU with enough memory to hold the entire DNN.

机译：最广泛使用的机器学习框架要求用户仔细调整其内存使用情况，以便深度神经网络（DNN）可以适合GPU的DRAM容量。这种限制妨碍了研究人员研究不同机器学习算法的灵活性，从而迫使他们要么使用不太理想的网络体系结构，要么跨多个GPU并行处理。我们提出了一个运行时内存管理器，用于虚拟化DNN的内存使用情况，以便GPU和CPU内存可以同时用于训练更大的DNN。我们的虚拟化DNN（vDNN）将AlexNet的平均GPU内存使用量减少了89％，OverFeat减少了91％，GoogLeNet减少了95％，从而大大降低了DNN的内存需求。在VGG-16上进行的类似实验（迄今为止最深入的内存不足DNN）之一，证明了我们建议的存储效率。 vDNN使批量大小为256（需要28 GB内存）的VGG-16可以在包含12 GB内存的单块NVIDIA Titan X GPU卡上进行训练，与假设的，具有足够内存以容纳的假眼式GPU相比，性能损失为18％整个DNN。

著录项

来源
《Annual IEEE/ACM International Symposium on Microarchitecture》|2016年|1-13|共13页
会议地点
作者
Minsoo Rhu; Natalia Gimelshein; Jason Clemons; Arslan Zulfiqar; Stephen W. Keckler;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Memory management; Graphics processing units; Neural networks; Training; Resource management; Backpropagation; Feature extraction;

机译：内存管理;图形处理单元;神经网络;培训;资源管理;反向传播;特征提取;

相似文献

外文文献
中文文献
专利

1. Efficient deep neural networks for classification of COVID-19 based on CT images: Virtualization via software defined radio [J] . Fouladi Saman, Ebadi M. J., Safaei Ali A., Computer Communications . 2021,第Auga期

机译：基于CT图像的Covid-19分类的高效深度神经网络：虚拟化通过软件定义的无线电
2. MEC: Memory-efficient Convolution for Deep Neural Network [J] . Minsik Cho, Daniel Brand JMLR: Workshop and Conference Proceedings . 2017,第3期

机译：MEC：深度神经网络的内存高效卷积
3. Heuristic Orinciples for the design of artificial neural networks principles for the design of artificial neural networks [J] . Steven Walczaka, Narciso Cerngb Information and Software Technology . 1999,第2期

机译：人工神经网络设计的启发式原理人工神经网络设计的启发式原理
4. vDNN: Virtualized deep neural networks for scalable, memory-efficient neural network design [C] . Minsoo Rhu, Natalia Gimelshein, Jason Clemons, International Symposium on Microarchitecture . 2016

机译：VDNN：虚拟化深度神经网络，可扩展，内存高效的神经网络设计
5. Exploring the Design Space of Deep Convolutional Neural Networks at Large Scale. [D] . Iandola, Forrest. 2016

机译：大规模探索深度卷积神经网络的设计空间。
6. Deep neural networks show an equivalent and often superior performance to dermatologists in onychomycosis diagnosis: Automatic construction of onychomycosis datasets by region-based convolutional deep neural network [O] . Seung Seog Han, Gyeong Hun Park, Woohyung Lim, -1

机译：深度神经网络在灰指甲诊断中显示出与皮肤科医生相当且通常优于皮肤病的性能：通过基于区域的卷积深度神经网络自动构建灰指甲数据集
7. vDNN: Virtualized Deep Neural Networks for Scalable, Memory-Efficient Neural Network Design [O] . Rhu, Minsoo, Gimelshein, Natalia, Clemons, Jason, 2016

机译：vDNN：虚拟化深度神经网络，可扩展，节省内存神经网络设计

vDNN: Virtualized deep neural networks for scalable, memory-efficient neural network design

摘要

著录项

相似文献

相关主题

期刊订阅