...
首页> 外文期刊>Selected Topics in Signal Processing, IEEE Journal of >Dual Dynamic Inference: Enabling More Efficient, Adaptive, and Controllable Deep Inference
【24h】

Dual Dynamic Inference: Enabling More Efficient, Adaptive, and Controllable Deep Inference

机译:双动态推理:启用更高效,自适应和可控的深度推理

获取原文
获取原文并翻译 | 示例
           

摘要

State-of-the-art convolutional neural networks (CNNs) yield record-breaking predictive performance, yet at the cost of high-energy-consumption inference, that prohibits their widely deployments in resource-constrained Internet of Things (IoT) applications. We propose a dual dynamic inference (DDI) framework that highlights the following aspects: 1) we integrate both input-dependent and resource-dependent dynamic inference mechanisms under a unified framework in order to fit the varying IoT resource requirements in practice. DDI is able to both constantly suppress unnecessary costs for easy samples, and to halt inference for all samples to meet hard resource constraints enforced; 2) we propose a flexible multi-grained learning to skip (MGL2S) approach for input-dependent inference which allows simultaneous layer-wise and channel-wise skipping; 3) we extend DDI to complex CNN backbones such as DenseNet and show that DDI can be applied towards optimizing any specific resource goals including inference latency and energy cost. Extensive experiments demonstrate the superior inference accuracy-resource trade-off achieved by DDI, as well as the flexibility to control such a trade-off as compared to existing peer methods. Specifically, DDI can achieve up to 4 times computational savings with the same or even higher accuracy as compared to existing competitive baselines.
机译:最先进的卷积神经网络(CNNS)产量记录破坏预测性能,但在高能耗推理的成本上,禁止其在资源受限的内容(IOT)应用程序中广泛部署。我们提出了一种双动态推理(DDI)框架,突出了以下几个方面:1)我们在统一的框架下集成了输入相关和资源相关的动态推断机制,以便在实践中符合不同的物联网资源要求。 DDI能够持续抑制不必要的成本,以便容易采样,并停止所有样本的推理,以满足强制执行的硬资源限制; 2)我们提出了一种灵活的多粒度学习,用于跳过(MGL2S)方法,用于输入依赖性推断,其允许同时层和通道跳闸; 3)我们将DDI扩展到复杂的CNN骨架,如DENSENET,并显示DDI可以应用于优化包括推理延迟和能量成本的任何特定资源目标。广泛的实验证明了DDI实现的卓越推理精度资源折衷,以及与现有对等方法相比控制这种权衡的灵活性。具体而言,与现有竞争性基线相比,DDI最多可以通过相同或甚至更高的准确度实现高达4倍的计算节省。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号