Simplifying long short-term memory acoustic models for fast training and decoding

机译：简化长短期记忆声学模型以进行快速训练和解码

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

On acoustic modeling, recurrent neural networks (RNNs) using Long Short-Term Memory (LSTM) units have recently been shown to outperform deep neural networks (DNNs) models. This paper focuses on resolving two challenges faced by LSTM models: high model complexity and poor decoding efficiency. Motivated by our analysis of the gates activation and function, we present two LSTM simplifications: deriving input gates from forget gates, and removing recurrent inputs from output gates. To accelerate decoding of LSTMs, we propose to apply frame skipping during training, and frame skipping and posterior copying (FSPC) during decoding. In the experiments, model simplifications reduce the size of LSTM models by 26%, resulting in a simpler model structure. Meanwhile, the application of FSPC speeds up model computation by 2 times during LSTM decoding. All these improvements are achieved at the cost of 1% WER degradation.

机译：在声学建模方面，最近显示使用长短期记忆（LSTM）单元的递归神经网络（RNN）优于深度神经网络（DNN）模型。本文着重解决LSTM模型面临的两个挑战：模型复杂度高和解码效率低。通过对门的激活和功能的分析，我们提出了两种LSTM简化方法：从忘记门派生输入门，从输出门中删除递归输入。为了加速LSTM的解码，我们建议在训练过程中应用跳帧，并在解码过程中应用跳帧和后向复制（FSPC）。在实验中，模型简化使LSTM模型的大小减少了26％，从而简化了模型结构。同时，FSPC的应用在LSTM解码期间将模型计算速度提高了2倍。所有这些改进都是以WER降低1％的代价实现的。

著录项

来源
《IEEE International Conference on Acoustics, Speech and Signal Processing》|2016年|2284-2288|共5页
会议地点
作者
Yajie Miao; Jinyu Li; Yongqiang Wang; Shi-Xiong Zhang; Yifan Gong;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Long Short-Term Memory; decoding efficiency; model simplification; recurrent neural network;

机译：长短期记忆;解码效率;模型简化;递归神经网络;

相似文献

外文文献
中文文献
专利

1. Simplified long short-term memory model for robust and fast prediction [J] . Yong Liu, Xin Hao, Biling Zhang, Pattern recognition letters . 2020,第Auga期

机译：简化的长期短期内存模型，用于稳健和快速预测
2. Long short-term memory recurrent neural network-based acoustic model using connectionist temporal classification on a large-scale training corpus [J] . Donghyun Lee, Minkyu Lim, Hosung Park, Communications, China . 2017,第9期

机译：大型训练语料库上使用连接器时间分类的基于长期短期记忆递归神经网络的声学模型
3. Long short-term memory recurrent neural network architectures for Urdu acoustic modeling [J] . Tehseen Zia, Usman Zahid International journal of speech technology . 2019,第1期

机译：用于Urdu声学建模的长短期记忆递归神经网络架构
4. Simplifying long short-term memory acoustic models for fast training and decoding [C] . Yajie Miao, Jinyu Li, Yongqiang Wang, IEEE International Conference on Acoustics, Speech and Signal Processing . 2016

机译：简化长期内存声学模型，用于快速训练和解码
5. A comparison of musicians' and nonmusicians' phonological short-term memory, visual short-term memory, and short-term memory for pitch [D] . Mitchell, Lucy 2013

机译：音乐家和非音乐家的语音短期记忆，视觉短期记忆和音调短期记忆的比较
6. Ordered short-term memory differs in signers and speakers: Implications for models of short-term memory [O] . Daphne Bavelier, Elissa L. Newport, Matt Hall, -1

机译：签名者和说话者的有序短期记忆有所不同：短期记忆模型的含义
7. Data Preparation and Training Methodology for Modeling Lithium-Ion Batteries Using a Long Short-Term Memory Neural Network for Mild-Hybrid Vehicle Applications [O] . Daniel Jerouschek, Ömer Tan, Ralph Kennel, 2020

机译：使用长短短期记忆神经网络进行锂离子电池进行锂离子电池的数据准备和培训方法，用于轻度混合动力车辆应用

Simplifying long short-term memory acoustic models for fast training and decoding

摘要

著录项

相似文献

相关主题

期刊订阅