首页> 外文会议>International Conference on Human Computer Interactions >Multi-Model Inference Acceleration on Embedded Multi-Core Processors

【24h】

Multi-Model Inference Acceleration on Embedded Multi-Core Processors

机译：嵌入式多核处理器上的多模型推理加速

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

The predominant resource efficient approaches that enable on-device inference include designing lightweight DNN architectures like MobileNets, SqueezeNets, compressing model using techniques such as network pruning, vector quantization, distillation, binarization. Recent research on using dynamic layer-wise partitioning and partial execution of CNN based model inference also make it possible to co-inference on memory and computation resource constrained devices. However, these approaches have their own bottleneck, lightweight DNN architectures and model compression usually compromise accuracy in order to deploy on resource constrained devices, dynamic model partitioning efficiency depends heavily on the network condition. This paper proposes an approach for multimodel inference acceleration on heterogeneous devices. The idea is to deploy multiple single object detection model instead of one heavy multiple object, this is because in most cases it only needs to detect one or two objects in one scenario and single object detection model weight could be lighter for the same resolution quality and require less resource. Moreover, in cloud-edge-device scenario, with the help of a scheduler policy, it is possible to gradually update models in need.

机译：使能On-Device推断的主要资源有效方法包括设计MobileNets，SheeezEnets，压缩模型等轻量级DNN架构，使用诸如网络修剪，矢量量化，蒸馏，二值化等技术。最近使用基于CNN的模型推断的动态层明智的分区和部分执行的研究还使得可以在存储器和计算资源受限设备上共同推断。但是，这些方法具有自己的瓶颈，轻量级DNN架构和模型压缩通常会损害精度，以便在资源受限设备上部署，动态模型分区效率在很大程度上取决于网络条件。本文提出了一种对异构装置的多模型推理加速的方法。这个想法是部署多个单个对象检测模型而不是一个沉重的多个对象，这是因为在大多数情况下，它只需要在一个场景中检测一个或两个对象，并且单个对象检测模型重量对于相同的分辨率质量和单个对象检测模型重量可能会更轻。需要更少的资源。此外，在云边缘设备方案中，在调度程序策略的帮助下，可以逐步更新需要的模型。

著录项

来源
《International Conference on Human Computer Interactions 》|2020年|400-403|共4页
会议地点
作者
Peiqi Shi; Feng Gao; Songtao Liang; Shanjin Yu;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Human computer interaction; Multicore processing; Computational modeling; Vector quantization; Object detection; Computer architecture; Dynamic scheduling;

机译：人机互动;多核处理;计算建模;矢量量化;对象检测;计算机架构;动态调度;

相似文献

外文文献
中文文献
专利

1. Performance Evaluation of Inter-Processor Communication for an Embedded Heterogeneous Multi-Core Processor [J] . Shiao-Li Tsao, Sung-Yuan Lee Journal of information science and engineering . 2012 ,第3期

机译：嵌入式异构多核处理器的处理器间通信性能评估
2. Acceleration techniques and evaluation on multi-core CPU, GPU and FPGA for image processing and super-resolution [J] . Georgis Georgios, Lentaris George, Reisis Dionysios Journal of Real-Time Image Processing . 2019 ,第4期

机译：在多核CPU，GPU和FPGA上进行图像处理和超分辨率的加速技术和评估
3. Acceleration techniques and evaluation on multi-core CPU, GPU and FPGA for image processing and super-resolution [J] . Georgis Georgios, Lentaris George, Reisis Dionysios Journal of Real-Time Image Processing . 2019 ,第4期

机译：用于图像处理和超分辨率的多核CPU，GPU和FPGA的加速技术和评估
4. Remote sensing data processing acceleration based on multi-core processors [C] . Xiao Zheng, Yong Xue, Jie Guang, IEEE International Geoscience and Remote Sensing Symposium . 2016

机译：基于多核处理器的遥感数据处理加速
5. Serial Code Acceleration with Single-ISA Asymmetric Multi-core Processor. [D] . Raman, Srikumar. 2015

机译：具有单ISA非对称多核处理器的串行代码加速。
6. Tensor-Based CUDA Optimization for ANN Inferencing Using Parallel Acceleration on Embedded GPU [O] . Ahmed Khamis Abdullah Al Ghadani, Waleeja Mateen, Rameshkumar G. Ramaswamy -1

机译：基于张量的基于CUDA优化的嵌入式GPU上的并行加速用于神经网络推理
7. Adding Tightly-Integrated Task Scheduling Acceleration to a RISC-V Multi-core Processor [O] . Lucas Morais, Vitor Silva, Alfredo Goldman, 2019

机译：将紧密集成的任务调度加速添加到RISC-V多核处理器
8. Multi-Core Processors: An Enabling Technology for Embedded Distributed Model-Based Control (Postprint); Conference paper [R] . Behbahani, A., Gibson, N., Rangarajan, M., 2008

机译：多核处理器：嵌入式分布式模型控制的一种支持技术（postprint）;会议文件

Multi-Model Inference Acceleration on Embedded Multi-Core Processors

摘要

著录项

相似文献

相关主题

期刊订阅