Investigation of Knowledge Transfer Approaches to Improve the Acoustic Modeling of Vietnamese ASR System

Danyang Liu; Ji Xu; Pengyuan Zhang; Yonghong Yan

首页> 中文期刊> 《自动化学报：英文版》 >Investigation of Knowledge Transfer Approaches to Improve the Acoustic Modeling of Vietnamese ASR System

Investigation of Knowledge Transfer Approaches to Improve the Acoustic Modeling of Vietnamese ASR System

开具论文收录证明 >>

期刊封面封底目录下载 >>

文献代查 >>

页面导航

摘要
著录项
相关主题

摘要

cqvip:It is well known that automatic speech recognition(ASR) is a resource consuming task. It takes sufficient amount of data to train a state-of-the-art deep neural network acoustic model. As for some low-resource languages where scripted speech is difficult to obtain, data sparsity is the main problem that limits the performance of speech recognition system. In this paper, several knowledge transfer methods are investigated to overcome the data sparsity problem with the help of high-resource languages.The first one is a pre-training and fine-tuning(PT/FT) method, in which the parameters of hidden layers are initialized with a welltrained neural network. Secondly, the progressive neural networks(Prognets) are investigated. With the help of lateral connections in the network architecture, Prognets are immune to forgetting effect and superior in knowledge transferring. Finally,bottleneck features(BNF) are extracted using cross-lingual deep neural networks and serves as an enhanced feature to improve the performance of ASR system. Experiments are conducted in a low-resource Vietnamese dataset. The results show that all three methods yield significant gains over the baseline system, and the Prognets acoustic model performs the best. Further improvements can be obtained by combining the Prognets model and bottleneck features.

著录项

来源
《自动化学报：英文版》 |2019年第5期|P.1187-1195|共9页
作者
Danyang Liu; Ji Xu; Pengyuan Zhang; Yonghong Yan;
展开▼
作者单位

[1]the Key Laboratory of Speech Acoustics and Content Understanding Institute of Acoustics Chinese Academy of Sciences Beijing 100190 China;

[2]the School of Electronic Electrical and Communication Engineering University of Chinese Academy of Sciences Beijing 101408 China;

[3]with Xinjiang Laboratory of Minority Speech and Language Information Processing Xinjiang Technical Institute of Physics and Chemistry Chinese Academy of Sciences Urumqi 830011 China;

展开▼
原文格式 PDF
正文语种 chi
中图分类自动化技术、计算机技术;
关键词
Bottleneck feature (BNF); cross-lingual automatic speech recognition (ASR); progressive neural networks (Prognets) model; transfer learning;

机译：瓶颈功能（BNF）;跨语言自动语音识别（ASR）;渐进神经网络（Prognets）模型;转移学习;

Investigation of Knowledge Transfer Approaches to Improve the Acoustic Modeling of Vietnamese ASR System

摘要

著录项

相关主题

期刊订阅