Sequence-To-Sequence Neural Networks Inference on Embedded Processors Using Dynamic Beam Search

首页> 外文期刊>Progress in Artificial Intelligence >Sequence-To-Sequence Neural Networks Inference on Embedded Processors Using Dynamic Beam Search

【24h】

Sequence-To-Sequence Neural Networks Inference on Embedded Processors Using Dynamic Beam Search

机译：使用动态波束搜索对嵌入处理器的序列到序列神经网络推断

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Sequence-to-sequence deep neural networks have become the state of the art for a variety of machine learning applications, ranging from neural machine translation (NMT) to speech recognition. Many mobile and Internet of Things (IoT) applications would benefit from the ability of performing sequence-to-sequence inference directly in embedded devices, thereby reducing the amount of raw data transmitted to the cloud, and obtaining benefits in terms of response latency, energy consumption and security. However, due to the high computational complexity of these models, specific optimization techniques are needed to achieve acceptable performance and energy consumption on single-core embedded processors. In this paper, we present a new optimization technique called dynamic beam search, in which the inference complexity is tuned to the difficulty of the processed input sequence at runtime. Results based on measurements on a real embedded device, and on three state-of-the-art deep learning models, show that our method is able to reduce the inference time and energy by up to 25% without loss of accuracy.

机译：序列到序列的深神经网络已成为各种机器学习应用的领域，从神经机翻译（NMT）到语音识别。许多移动和内容的东西（IOT）应用程序将受益于直接在嵌入式设备中执行序列到序列推理的能力，从而减少传输到云的原始数据量，并在响应延迟，能量方面获得益处消费和安全。然而，由于这些模型的高计算复杂性，需要特定的优化技术来实现单核嵌入式处理器上可接受的性能和能耗。在本文中，我们介绍了一种名为动态波束搜索的新优化技术，其中推断复杂性被调整为运行时处理的输入序列的难度。结果基于对真实嵌入式设备的测量，以及三种最先进的深度学习模型，表明我们的方法能够将推理时间和能量降低到25％，而不会损失准确性。

著录项

来源
《Progress in Artificial Intelligence》 |2020年第2期|共21页
作者

展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类计算技术、计算机技术;
关键词
recurrent neural networks; edge computing; energy efficiency;

机译：经常性神经网络;边缘计算;能效;

相似文献

外文文献
中文文献
专利

1. Sequence-To-Sequence Neural Networks Inference on Embedded Processors Using Dynamic Beam Search [J] . Progress in Artificial Intelligence . 2020,第2期

机译：使用动态波束搜索对嵌入处理器的序列到序列神经网络推断
2. SNAP: An Efficient Sparse Neural Acceleration Processor for Unstructured Sparse Deep Neural Network Inference [J] . Zhang Jie-Fang, Lee Ching-En, Liu Chester, IEEE Journal of Solid-State Circuits . 2021,第2期

机译：SNAP：一个有效的稀疏神经加速处理器，用于非结构化稀疏深神经网络推理
3. Exploration of Fracture Dynamics Properties and Predicting Fracture Toughness of Individual Wood Beams Using Neural Networks [J] . Samarasinghe Sandhya Silva Fennica . 2009,第2期

机译：用神经网络探索断裂动力学特性并预测单个木梁的断裂韧性
4. Processor Pipelining Method for Efficient Deep Neural Network Inference on Embedded Devices [C] . Akshay Parashar, Arun Abraham, Deepak Chaudhary, International Conference on High Performance Computing, Data, and Analytics . 2020

机译：对嵌入式设备有效深度神经网络推断的处理器流水线方法
5. Kernel Mechanisms for Efficient GPU Accelerated Deep Neural Network Inference on Embedded Devices [D] . Nigam, Hemant. 2018

机译：高效GPU的内核机制加速了对嵌入式设备的深神经网络推断
6. Dynamics of Elastic Beams with Embedded Fluid-Filled Parallel-Channel Networks [O] . Yoav Matia, Amir D. Gat -1

机译：嵌入式充液并联通道网络的弹性梁动力学
7. Efficient Neuro-Fuzzy Inference System (ANFIS) and Neural Networks Systems for Different Beams Collisions with Light Nuclei [O] . A.A. El-Harby, Eman Algrafy 2018

机译：高效的神经模糊推理系统（ANFIS）和针对不同光束碰撞的神经网络系统与光核

Sequence-To-Sequence Neural Networks Inference on Embedded Processors Using Dynamic Beam Search

摘要

著录项

相似文献

相关主题

期刊订阅