首页> 外文会议>Workshop on Automatic Speech Recognition and Understanding >ELASTIC SPECTRAL DISTORTION FOR LOW RESOURCE SPEECH RECOGNITION WITH DEEP NEURAL NETWORKS
【24h】

ELASTIC SPECTRAL DISTORTION FOR LOW RESOURCE SPEECH RECOGNITION WITH DEEP NEURAL NETWORKS

机译:具有深神经网络的低资源语音识别的弹性光谱失真

获取原文

摘要

An acoustic model based on hidden Markov models with deep neural networks (DNN-HMM) has recently been proposed and achieved high recognition accuracy. In this paper, we investigated an elastic spectral distortion method to artificially augment training samples to help DNN-HMMs acquire enough robustness even when there are a limited number of training samples. We investigated three distortion methods-vocal tract length distortion, speech rate distortion, and frequency-axis random distortion-and evaluated those methods with Japanese lecture recordings. In a large vocabulary continuous speech recognition task with only 10 hours of training samples, a DNN-HMM trained with the elastic spectral distortion method achieved a 10.1% relative word error reduction compared with a normally trained DNN-HMM.
机译:最近提出了一种基于隐马尔可夫模型的声学模型,最近提出了高度神经网络(DNN-HMM)并实现了高识别准确性。在本文中,我们研究了一种弹性光谱失真方法,以便人工增强训练样本,以帮助DNN-HMMS即使存在有限数量的训练样本而获得足够的稳健性。我们研究了三个失真方法 - 声带长度失真,语音失真和频率轴随机失真 - 并评估了日本讲义记录的这些方法。在仅具有10小时的训练样本的大型词汇连续语音识别任务中,用弹性光谱失真方法训练的DNN-HMM实现了10.1%的相对字误差减少,与通常训练的DNN-HMM相比。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号