Dual Dynamic Inference: Enabling More Efficient, Adaptive, and Controllable Deep Inference

Yue Wang; Jianghao Shen; Ting-Kuei Hu; Pengfei Xu; Tan Nguyen; Richard Baraniuk; Zhangyang Wang; Yingyan Lin

首页> 外文期刊>Selected Topics in Signal Processing, IEEE Journal of >Dual Dynamic Inference: Enabling More Efficient, Adaptive, and Controllable Deep Inference

【24h】

Dual Dynamic Inference: Enabling More Efficient, Adaptive, and Controllable Deep Inference

机译：双动态推理：启用更高效，自适应和可控的深度推理

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

State-of-the-art convolutional neural networks (CNNs) yield record-breaking predictive performance, yet at the cost of high-energy-consumption inference, that prohibits their widely deployments in resource-constrained Internet of Things (IoT) applications. We propose a dual dynamic inference (DDI) framework that highlights the following aspects: 1) we integrate both input-dependent and resource-dependent dynamic inference mechanisms under a unified framework in order to fit the varying IoT resource requirements in practice. DDI is able to both constantly suppress unnecessary costs for easy samples, and to halt inference for all samples to meet hard resource constraints enforced; 2) we propose a flexible multi-grained learning to skip (MGL2S) approach for input-dependent inference which allows simultaneous layer-wise and channel-wise skipping; 3) we extend DDI to complex CNN backbones such as DenseNet and show that DDI can be applied towards optimizing any specific resource goals including inference latency and energy cost. Extensive experiments demonstrate the superior inference accuracy-resource trade-off achieved by DDI, as well as the flexibility to control such a trade-off as compared to existing peer methods. Specifically, DDI can achieve up to 4 times computational savings with the same or even higher accuracy as compared to existing competitive baselines.

机译：最先进的卷积神经网络（CNNS）产量记录破坏预测性能，但在高能耗推理的成本上，禁止其在资源受限的内容（IOT）应用程序中广泛部署。我们提出了一种双动态推理（DDI）框架，突出了以下几个方面：1）我们在统一的框架下集成了输入相关和资源相关的动态推断机制，以便在实践中符合不同的物联网资源要求。 DDI能够持续抑制不必要的成本，以便容易采样，并停止所有样本的推理，以满足强制执行的硬资源限制; 2）我们提出了一种灵活的多粒度学习，用于跳过（MGL2S）方法，用于输入依赖性推断，其允许同时层和通道跳闸; 3）我们将DDI扩展到复杂的CNN骨架，如DENSENET，并显示DDI可以应用于优化包括推理延迟和能量成本的任何特定资源目标。广泛的实验证明了DDI实现的卓越推理精度资源折衷，以及与现有对等方法相比控制这种权衡的灵活性。具体而言，与现有竞争性基线相比，DDI最多可以通过相同或甚至更高的准确度实现高达4倍的计算节省。

著录项

来源
《Selected Topics in Signal Processing, IEEE Journal of》 |2020年第4期|623-633|共11页
作者
Yue Wang; Jianghao Shen; Ting-Kuei Hu; Pengfei Xu; Tan Nguyen; Richard Baraniuk; Zhangyang Wang; Yingyan Lin;
展开▼
作者单位

Department of Electrical and Computer Engineering Rice University Houston TX USA;

Department of Electrical and Computer Engineering Rice University Houston TX USA;

Department of Computer Science and Engineering Texas A&M University College Station TX USA;

Department of Electrical and Computer Engineering Rice University Houston TX USA;

Department of Electrical and Computer Engineering Rice University Houston TX USA;

Department of Electrical and Computer Engineering Rice University Houston TX USA;

Department of Computer Science and Engineering Texas A&M University College Station TX USA;

Department of Electrical and Computer Engineering Rice University Houston TX USA;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Dynamic scheduling; Adaptation models; Complexity theory; Computational modeling; Performance evaluation; Internet of Things; Training;

机译：动态调度;适应模型;复杂性理论;计算建模;绩效评估;事物互联网;培训;

相似文献

外文文献
中文文献
专利

1. Efficient single and dual axis solar tracking system controllers based on adaptive neural fuzzy inference system [J] . Nadia AL-Rousan, Nor Ashidi Mat Isa, Mohd Khairunaz Mat Desa Journal of King Saud University-Engineering Sciences . 2020,第7期

机译：基于自适应神经模糊推理系统的高效单轴和双轴太阳能跟踪系统控制器
2. Efficient single and dual axis solar tracking system controllers based on adaptive neural fuzzy inference system [J] . Nadia AL-Rousan, Nor Ashidi Mat Isa, Mohd Khairunaz Mat Desa Journal of King Saud University: Physics and Mathematics . 2020,第7期

机译：基于自适应神经模糊推理系统的高效单轴和双轴太阳能跟踪系统控制器
3. Efficient adaptive inference for deep convolutional neural networks using hierarchical early exits [J] . Passalis Nikolaos, Raitoharju Jenni, Tefas Anastasios, Pattern Recognition: The Journal of the Pattern Recognition Society . 2020,第期

机译：使用分层早期出口的深度卷积神经网络有效的自适应推断
4. LVRT Capabilities of Solar Energy Conversion System Enabling Power Quality Improvement Particle Swarm Optimization based Adaptive Neuro-Fuzzy Inference System for MPPT Control of a Three-Phase Grid-Connected Photovoltaic System [C] . Priyank Shah, Bhim Singh IEEE International Electric Machines and Drives Conference . 2019

机译：太阳能转换系统的LVRT能力能够实现电力质量改进粒子群优化的三相网格连接光伏系统MPPT控制的自适应神经模糊推理系统
5. Reshaping Deep Neural Networks for Efficient Hardware Inference [D] . Khodamoradi, Alireza. 2021

机译：重塑深神经网络以实现高效硬件推理
6. Towards an Efficient CNN Inference Architecture Enabling In-Sensor Processing [O] . Md Jubaer Hossain Pantho, Pankaj Bhowmik, Christophe Bobda 2021

机译：迈向有效的CNN推理架构实现了传感器处理
7. Dual Dynamic Inference: Enabling More Efficient, Adaptive, and Controllable Deep Inference [O] . Yue Wang, Jianghao Shen, Ting-Kuei Hu, 2020

机译：双动态推理：启用更高效，自适应和可控的深度推理

Dual Dynamic Inference: Enabling More Efficient, Adaptive, and Controllable Deep Inference

摘要

著录项

相似文献

相关主题

期刊订阅