首页> 外文会议>IEEE International Conference on Acoustics, Speech and Signal Processing >Bi-APC: Bidirectional Autoregressive Predictive Coding for Unsupervised Pre-Training and its Application to Children’s ASR
【24h】

Bi-APC: Bidirectional Autoregressive Predictive Coding for Unsupervised Pre-Training and its Application to Children’s ASR

机译:BI-APC:用于无监督的预测预测编码的双向自动评级预测编码及其在儿童ASR中的应用

获取原文
获取外文期刊封面目录资料

摘要

We present a bidirectional unsupervised model pre-training (UPT) method and apply it to children’s automatic speech recognition (ASR). An obstacle to improving child ASR is the scarcity of child speech databases. A common approach to alleviate this problem is model pre-training using data from adult speech. Pre-training can be done using supervised (SPT) or unsupervised methods, depending on the availability of annotations. Typically, SPT performs better. In this paper, we focus on UPT to address the situations when pre-training data are unlabeled. Autoregressive predictive coding (APC), a UPT method, predicts frames from only one direction, limiting its use to uni-directional pre-training. Conventional bidirectional UPT methods, however, predict only a small portion of frames. To extend the benefits of APC to bi-directional pre-training, Bi-APC is proposed. We then use adaptation techniques to transfer knowledge learned from adult speech (using the Librispeech corpus) to child speech (OGI Kids corpus). LSTM-based hybrid systems are investigated. For the uni-LSTM structure, APC obtains similar WER improvements to SPT over the baseline. When applied to BLSTM, however, APC is not as competitive as SPT, but our proposed Bi-APC has comparable improvements to SPT.
机译:我们提出了一个双向无人监督的模型预培训(UPT)方法,并将其应用于儿童的自动语音识别(ASR)。改善儿童ASR的障碍是儿童语音数据库的稀缺。缓解此问题的常见方法是使用来自成人语音的数据的模型预训练。根据注释的可用性,可以使用监督(SPT)或无监督的方法进行预培训。通常,SPT执行更好。在本文中,我们专注于UPT解决预训练数据未标记的情况。自回归预测编码(APC),UPT方法,从一个方向预测帧,限制其对单向预训练的用途。然而,传统的双向UPT方法仅预测框架的一小部分。为了扩展APC对双向预训练的益处,提出了BI-APC。然后,我们使用适应技巧从成年语音(使用LibrisPeech Corpus)到儿童语音(OGI Kids语料库)来传输知识。研究了基于LSTM的混合系统。对于UNI-LSTM结构,APC在基线上获得类似的WER改进。然而,当施加到BLSTM时,APC与SPT不那样竞争,但我们所提出的BI-APC对SPT具有可比性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号