首页> 外文会议>Sriwijaya International Conference on Information Technology and Its Applications >Accelerating Convolutional Neural Network Training for Colon Histopathology Images by Customizing Deep Learning Framework
【24h】

Accelerating Convolutional Neural Network Training for Colon Histopathology Images by Customizing Deep Learning Framework

机译:通过定制深层学习框架加速结肠组织病理学图像的卷积神经网络训练

获取原文

摘要

Cancer diagnose based on the histopathology images is still have some challenges. Convolutional Neural Network (CNN) is one of deep learning architecture that has widely used in medical image processing especially for cancer detection. The high resolution of images and complexity of CNN architecture causes cost-intensive in the training process. One way of reducing the training processes time is by introducing parallel processing. Graphics Processing Unit (GPU) is a graphics card which has many processors and has been widely used to speed-up the process. However, the problem in GPU is the limitation of memory size. Therefore, this study proposes alternative ways to utilize the GPU memory in the training of CNN architecture. Theano is one of middle-level framework for deep application. GPU memory is a critical task in training activity and will affect to the number of batch-size. Customizing memory allocation in Theano can be conducted by utilizing library called 'cnmem'. For training CNN architecture, we use NVIDIA GTX-980 that accelerated by customizing CUDA memory allocation from 'cnmem' library located in 'theanorc' file. In the experiment, the parameter of cnmem are chosen between 0 (not apply cnmem) or 1 (apply cnmem). We use image variation from 32x32, 64x64, 128x128, 180x180 and 200x200 pixels. In the training, a number of batch-size is selected experimentally from 10, 20, 50, 100 and 150 images. Our experiments show that enabling cnmem with the value of 1 will increase the speed-up. The 200x200 images show the most significant efficiency of GPU performance when training CNN. Speed-up is measured by comparing training time of GTX-980 with CPU core i7 machine from 16, 8, 4, 2 cores and the single-core. The highest speed-up GTX-980 obtained with enabling cnmem are 4.49, 5.00, 7.58, 11,97 and 16.19 compare to 16, 8, 4, 2 and 1 core processor respectively.
机译:基于组织病理学图像的癌症诊断仍然存在一些挑战。卷积神经网络(CNN)是深度学习架构之一,广泛用于医学图像处理,特别是对于癌症检测。高分辨率的CNN架构的图像和复杂性导致培训过程中的成本密集。减少训练过程的一种方法是通过引入并行处理。图形处理单元(GPU)是一个图形卡,具有许多处理器,并且已被广泛用于加速过程。但是,GPU中的问题是内存大小的限制。因此,本研究提出了在CNN架构训练中使用GPU存储器的替代方式。 Theano是深度应用的中级框架之一。 GPU内存是培训活动中的关键任务,并将影响批量大小的数量。可以通过利用名为“CNMEM”的库来进行自定义内存分配。对于培训CNN架构,我们使用NVIDIA GTX-980通过定制位于'theanorc'文件中的“CNMEM”库的CUDA内存分配来加速。在实验中,CNMEM的参数在0(不适用CNMEM)之间或1(应用CNMEM)之间。我们使用32x32,64x64,128x128,180x180和200x200像素的图像变型。在训练中,从10,20,50,100和150图像实验选择许多批量大小。我们的实验表明,使用值1的CNMEM将增加加速。 200x200图像显示CNN时,GPU性能最大效率。通过将GTX-980的训练时间与来自16,8,4,2个核和单芯的CPU核心I7机器进行比较来测量加速。通过启用CNMEM获得的最高加速GTX-980为4.49,5.00,7.58,11,97和16.19分别比较为16,8,4,2和1个核心处理器。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号