Approximate Computing for Long Short Term Memory (LSTM) Neural Networks

Sanchari Sen; Anand Raghunathan

首页> 外文期刊>IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems >Approximate Computing for Long Short Term Memory (LSTM) Neural Networks

【24h】

Approximate Computing for Long Short Term Memory (LSTM) Neural Networks

机译：长短期记忆（LSTM）神经网络的近似计算

获取原文

获取原文并翻译 | 示例

获取外文期刊封面封底 >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Long Short Term Memory (LSTM) networks are a class of recurrent neural networks that are widely used for machine learning tasks involving sequences, including machine translation, text generation, and speech recognition. Large-scale LSTMs, which are deployed in many real-world applications, are highly compute intensive. To address this challenge, we propose AxLSTM, an application of approximate computing to improve the execution efficiency of LSTMs. An LSTM is composed of cells, each of which contains a cell state along with multiple gating units that control the addition and removal of information from the state. The LSTM execution proceeds in timesteps, with a new symbol of the input sequence processed at each timestep. AxLSTM consists of two techniques-Dynamic Timestep Skipping (DTS) and Dynamic State Reduction (DSR). DTS identifies, at runtime, input symbols that are likely to have little or no impact on the cell state and skips evaluating the corresponding timesteps. In contrast, DSR reduces the size of the cell state in accordance with the complexity of the input sequence, leading to a reduced number of computations per timestep. We describe how AxLSTM can be applied to the most common application of LSTMs, viz., sequence-to-sequence learning. We implement AxLSTM within the TensorFlow deep learning framework and evaluate it on 3 state-of-the-art sequence-to-sequence models. On a 2.7 GHz Intel Xeon server with 128 GB memory and 32 processor cores, AxLSTM achieves 1.08 × -1.31× speedups with minimal loss in quality, and 1.12 × -1.37× speedups when moderate reductions in quality are acceptable.

机译：长短期记忆（LSTM）网络是一类递归神经网络，广泛用于涉及序列的机器学习任务，包括机器翻译，文本生成和语音识别。部署在许多实际应用中的大规模LSTM占用大量计算资源。为了应对这一挑战，我们提出了AxLSTM，这是一种近似计算的应用，可以提高LSTM的执行效率。 LSTM由单元组成，每个单元包含一个单元状态以及多个门控单元，这些门控单元控制状态信息的添加和删除。 LSTM执行按时间步进行，并在每个时间步处处理输入序列的新符号。 AxLSTM包含两种技术-动态时间步跳过（DTS）和动态状态缩减（DSR）。 DTS在运行时识别可能对单元状态几乎没有影响或没有影响的输入符号，并跳过评估相应的时间步长的过程。相反，DSR根据输入序列的复杂性减小了单元状态的大小，从而导致每个时间步的计算数量减少。我们描述了如何将AxLSTM应用于LSTM的最常见应用，即序列到序列学习。我们在TensorFlow深度学习框架中实现AxLSTM，并在3个最新的序列到序列模型中对其进行评估。在具有128 GB内存和32个处理器内核的2.7 GHz Intel Xeon服务器上，AxLSTM可以实现1.08×-1.31x的加速，而质量损失最小；如果可接受的质量下降幅度适中，则可以实现1.12×-1.37x的加速。

著录项

来源
《IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems》 |2018年第11期|2266-2276|共11页
作者
Sanchari Sen; Anand Raghunathan;
展开▼
作者单位

School of Electrical and Computer Engineering, Purdue University, West Lafayette, IN, USA;

School of Electrical and Computer Engineering, Purdue University, West Lafayette, IN, USA;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Approximate computing; Logic gates; Recurrent neural networks; Machine learning; Computational modeling; Task analysis;

机译：近似计算;逻辑门;递归神经网络;机器学习;计算建模;任务分析;

相似文献

外文文献
中文文献
专利

1. Groundwater level forecasting with artificial neural networks: a comparison of long short-term memory (LSTM), convolutional neural networks (CNNs), and non-linear autoregressive networks with exogenous input (NARX) [J] . Wunsch Andreas, Liesch Tanja, Broda Stefan Hydrology and Earth System Sciences Discussions . 2021,第3期

机译：与人工神经网络的地下水位预测：具有外源输入（NARX）的长短期记忆（LSTM），卷积神经网络（CNNS）和非线性自回归网络的比较
2. Groundwater level forecasting with artificial neural networks: a comparison of long short-term memory (LSTM), convolutional neural networks (CNNs), and non-linear autoregressive networks with exogenous input (NARX) [J] . Wunsch Andreas, Liesch Tanja, Broda Stefan Hydrology and Earth System Sciences . 2021,第3期

机译：与人工神经网络的地下水位预测：具有外源输入（NARX）的长短期记忆（LSTM），卷积神经网络（CNNS）和非线性自回归网络的比较
3. Hybrid convolutional neural network (CNN) and long-short term memory (LSTM) based deep learning model for detecting shilling attack in the social-aware network [J] . Vivekanandan K., Praveena N. Journal of ambient intelligence and humanized computing . 2021,第1期

机译：混合卷积神经网络（CNN）和基于长短短期存储器（LSTM）的深度学习模型，用于检测社交意识网络的先令攻击
4. Automatic Classification of Indian Languages into Tonal and Non-tonal Categories Using Cascade Convolutional Neural Network (CNN)-Long Short-Term Memory (LSTM) Recurrent Neural Networks [C] . Chuya China, Dipjyoti Bisharad, Rabul Hussain Laskar International Conference on Signal Processing and Communication Systems . 2018

机译：使用级联卷积神经网络（CNN）-长短期记忆（LSTM）递归神经网络将印度语言自动分类为音调和非音调类别
5. A study of neural networks and multiple neural networks in making short-term and long-term time-series prediction of petroleum production and gas consumption. [D] . Nguyen, Hanh Hong. 2003

机译：对神经网络和多重神经网络进行石油产量和天然气消耗的短期和长期时间序列预测的研究。
6. Language Identification in Short Utterances Using Long Short-Term Memory (LSTM) Recurrent Neural Networks [O] . Ruben Zazo, Alicia Lozano-Diez, Javier Gonzalez-Dominguez, 2011

机译：使用长短期记忆（LSTM）递归神经网络的短话语语言识别
7. Groundwater level forecasting with artificial neural networks: a comparison of long short-term memory (LSTM), convolutional neural networks (CNNs), and non-linear autoregressive networks with exogenous input (NARX) [O] . Andreas Wunsch, Tanja Liesch, Stefan Broda 2021

机译：人工神经网络的地下水位预测：具有外源输入（NARX）的长短期记忆（LSTM），卷积神经网络（CNNS）和非线性自回归网络的比较

Approximate Computing for Long Short Term Memory (LSTM) Neural Networks

摘要

著录项

相似文献

相关主题

期刊订阅