首页> 外文会议>2019 56th ACM/IEEE Design Automation Conference >ReForm: Static and Dynamic Resource-Aware DNN Reconfiguration Framework for Mobile Device
【24h】

ReForm: Static and Dynamic Resource-Aware DNN Reconfiguration Framework for Mobile Device

机译:ReForm:用于移动设备的静态和动态资源感知DNN重新配置框架

获取原文
获取原文并翻译 | 示例

摘要

Although the Deep Neural Network (DNN) technique has been widely applied in various applications, the DNN-based applications are still too computationally intensive for the resource-constrained mobile devices. Many works have been proposed to optimize the DNN computation performance, but most of them are limited in an algorithmic perspective, ignoring certain computing issues in practical deployment. To achieve the comprehensive DNN performance enhancement in practice, the expected DNN optimization works should closely cooperate with specific hardware and system constraints (i.e. computation capacity, energy cost, memory occupancy, and inference latency). Therefore, in this work, we propose ReForm-a resource-aware DNN optimization framework. Through thorough mobile DNN computing analysis and innovative model reconfiguration schemes (i.e. ADMM based static model fine-tuning, dynamically selective computing), ReForm can efficiently and effectively recon Figure a pre-trained DNN model for practical mobile deployment with regards to various static and dynamic computation resource constraints. Experiments show that ReForm has $sim 3.5imes$ faster optimization speed than state-of-the-art resource-aware optimization method. Also, ReForm can effective recon Figure a DNN model to different mobile devices with distinct resource constraints. Moreover, ReForm achieves satisfying computation cost reduction with ignorable accuracy drop in both static and dynamic computing scenarios (at most 18% workload, 16.23% latency, 48.63% memory, and 21.5% energy enhancement).
机译:尽管深度神经网络(DNN)技术已广泛应用于各种应用程序中,但对于资源受限的移动设备,基于DNN的应用程序在计算上仍然过于密集。已经提出了许多工作来优化DNN计算性能,但是大多数工作都在算法角度上受到限制,而忽略了实际部署中的某些计算问题。为了在实践中实现全面的DNN性能增强,预期的DNN优化工作应与特定的硬件和系统约束条件(例如计算能力,能源成本,内存占用和推理延迟)紧密配合。因此,在这项工作中,我们提出了ReForm-一种资源感知的DNN优化框架。通过彻底的移动DNN计算分析和创新的模型重新配置方案(即基于ADMM的静态模型微调,动态选择性计算),ReForm可以针对各种静态和动态情况有效地有效地重构预先训练的DNN模型,以用于实际的移动部署计算资源约束。实验表明,ReForm的优化速度比最新的资源感知优化方法快$ \ sim 3.5 \\倍。同样,ReForm可以有效地将DNN模型重构为具有不同资源限制的不同移动设备。此外,ReForm在静态和动态计算方案中(可实现最多18%的工作量,16.23%的等待时间,48.63%的内存和21.5%的能耗提升)都可实现令人满意的降低了的计算成本,而准确性却明显下降。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号