Extended low-rank plus diagonal adaptation for deep and recurrent neural networks

机译：用于深度和递归神经网络的扩展低秩加对角线适应

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Recently, the low-rank plus diagonal (LRPD) adaptation was proposed for speaker adaptation of deep neural network (DNN) models. The LRPD restructures the adaptation matrix as a superposition of a diagonal matrix and a product of two low-rank matrices. In this paper, we extend the LRPD adaptation into the subspace-based approach to further reduce the speaker-dependent (SD) footprint. We apply the extended LRPD (eLRPD) adaptation for the DNN and LSTM models with emphasis placed on the applicability of the adaptation to large-scale speech recognition systems. To speed up the adaptation in test time, we propose the bottleneck (BN) caching approach to eliminate the redundant computations during multiple sweeps of development data. Experimental results on the short message dictation (SMD) task show that the eLRPD adaptation can reduce the SD footprints by 82% for the SVD DNN and 96% for the LSTM-RNN over the linear adaptation, while maintaining the comparable accuracy. The BN caching achieves up to 3.5 times speedup in adaptation at no loss of recognition accuracy.

机译：最近，提出了低秩加对角线（LRPD）自适应用于深度神经网络（DNN）模型的说话者自适应。 LRPD将适应矩阵重构为对角矩阵和两个低秩矩阵的乘积的叠加。在本文中，我们将LRPD适应性扩展到基于子空间的方法中，以进一步减少说话者相关（SD）的占用空间。我们将扩展的LRPD（eLRPD）适应性应用于DNN和LSTM模型，重点放在适应性在大型语音识别系统上的适用性。为了加快测试时间的适应性，我们提出了瓶颈（BN）缓存方法，以消除多次扫描开发数据期间的冗余计算。短消息听写（SMD）任务的实验结果表明，与线性自适应相比，eLRPD自适应可以使SVD DNN和LSTM-RNN的SD占用空间减少82 \％，而LSTM-RNN可以减少96％。 BN缓存的自适应速度提高了3.5倍，而不会损失识别精度。

著录项

来源
《IEEE International Conference on Acoustics, Speech and Signal Processing》|2017年|5040-5044|共5页
会议地点
作者
Yong Zhao; Jinyu Li; Kshitiz Kumar; Yifan Gong;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Adaptation models; Silicon; Training; Speech recognition; Logic gates; Matrix decomposition; Transforms;

机译：适应模型;硅;训练;语音识别;逻辑门;矩阵分解;变换;

相似文献

外文文献
中文文献
专利

1. Fast speaker adaptation using extended diagonal linear transformation for deep neural networks [J] . Donghyun Kim, Sanghun Kim ETRI journal . 2019,第1期

机译：使用扩展对角线性变换的深度神经网络快速说话人自适应
2. Adaptation of diagonal recurrent neural network model [J] . D. L. Yu, T. K. Chang Neural computing & applications . 2005,第3期

机译：对角递归神经网络模型的适应
3. Rethinking the Combined and Individual Orders of Derivative of States for Differential Recurrent Neural Networks: Deep Differential Recurrent Neural Networks [J] . NAIFAN ZHUANG, GUO-JUN QI, THE DUC KIEU, ACM transactions on multimedia computing communications and applications . 2019,第3期

机译：重新思考各国衍生物的衍生物的衍生物的衍生物：深度差分经常性神经网络
4. Extended low-rank plus diagonal adaptation for deep and recurrent neural networks [C] . Yong Zhao, Jinyu Li, Kshitiz Kumar, IEEE International Conference on Acoustics, Speech and Signal Processing . 2017

机译：扩展低级别加上深度和经常性神经网络的对角线适配
5. Engineering Recurrent Neural Networks for Low-Rank and Noise-Robust Computation [D] . Stock, Christopher Hopkins. 2021

机译：用于低级和噪声稳健计算的工程经常性神经网络
6. Low-Rank Deep Convolutional Neural Network for Multitask Learning [O] . Fang Su, Hai-Yang Shang, Jing-Yan Wang 2019

机译：用于多任务学习的低秩深度卷积神经网络
7. Learning Low-rank Deep Neural Networks via Singular Vector Orthogonality Regularization and Singular Value Sparsification [O] . Huanrui Yang, Minxue Tang, Wei Wen, 2020

机译：通过奇异的载体正常化和奇异值稀疏学习低级深神经网络

Extended low-rank plus diagonal adaptation for deep and recurrent neural networks

摘要

著录项

相似文献

相关主题

期刊订阅