首页> 外文会议>Workshop on Automatic Speech Recognition and Understanding >ELASTIC SPECTRAL DISTORTION FOR LOW RESOURCE SPEECH RECOGNITION WITH DEEP NEURAL NETWORKS

【24h】

ELASTIC SPECTRAL DISTORTION FOR LOW RESOURCE SPEECH RECOGNITION WITH DEEP NEURAL NETWORKS

机译：具有深神经网络的低资源语音识别的弹性光谱失真

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

An acoustic model based on hidden Markov models with deep neural networks (DNN-HMM) has recently been proposed and achieved high recognition accuracy. In this paper, we investigated an elastic spectral distortion method to artificially augment training samples to help DNN-HMMs acquire enough robustness even when there are a limited number of training samples. We investigated three distortion methods-vocal tract length distortion, speech rate distortion, and frequency-axis random distortion-and evaluated those methods with Japanese lecture recordings. In a large vocabulary continuous speech recognition task with only 10 hours of training samples, a DNN-HMM trained with the elastic spectral distortion method achieved a 10.1% relative word error reduction compared with a normally trained DNN-HMM.

机译：最近提出了一种基于隐马尔可夫模型的声学模型，最近提出了高度神经网络（DNN-HMM）并实现了高识别准确性。在本文中，我们研究了一种弹性光谱失真方法，以便人工增强训练样本，以帮助DNN-HMMS即使存在有限数量的训练样本而获得足够的稳健性。我们研究了三个失真方法 - 声带长度失真，语音失真和频率轴随机失真 - 并评估了日本讲义记录的这些方法。在仅具有10小时的训练样本的大型词汇连续语音识别任务中，用弹性光谱失真方法训练的DNN-HMM实现了10.1％的相对字误差减少，与通常训练的DNN-HMM相比。

著录项

来源
《Workshop on Automatic Speech Recognition and Understanding 》|2013年||共6页
会议地点
作者
Naoyuki Kanda; Ryu Takeda; Yasunari Obuchi;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TN912.3-532;
关键词
Deep neural network; speech recognition; elastic distortion;

机译：深神经网络;语音识别;弹性扭曲;

相似文献

外文文献
中文文献
专利

1. Multilingual Convolutional, Long Short-Term Memory, Deep Neural Networks for Low Resource Speech Recognition [J] . Danish bukhari, Yutian Wang, Hui Wang Procedia Computer Science . 2017 ,第1期

机译：多语言卷积，长短期记忆，用于低资源语音识别的深度神经网络
2. Multitask Learning of Deep Neural Networks for Low-Resource Speech Recognition [J] . Chen Dongpeng, Mak Brian Kan-Wing Audio, Speech, and Language Processing, IEEE/ACM Transactions on . 2015 ,第7期

机译：深度神经网络的多任务学习，用于低资源语音识别
3. Investigation of Automatic Speech Recognition Systems via the Multilingual Deep Neural Network Modeling Methods for a Very Low-Resource Language, Chaha [J] . Tessfu Geteye Fantaye, Junqing Yu, Tulu Tilahun Hailu Journal of Signal and Information Processing . 2020 ,第1期

机译：Chaha非常低于资源语言的多语言深神经网络建模方法对自动语音识别系统的研究
4. Elastic spectral distortion for low resource speech recognition with deep neural networks [C] . Kanda Naoyuki, Takeda Ryu, Obuchi Yasunari IEEE Workshop on Automatic Speech Recognition and Understanding . 2013

机译：弹性频谱失真用于深度神经网络的低资源语音识别
5. Dysarthric Speech Recognition and Offline Handwriting Recognition using Deep Neural Networks. [D] . Pillai, Suhas Balkrishna. 2017

机译：使用深度神经网络的表情异常语音识别和离线手写识别。
6. Multi-resolution speech analysis for automatic speech recognition using deep neural networks: Experiments on TIMIT [O] . Doroteo T. Toledano, María Pilar Fernández-Gallego, Alicia Lozano-Diez 2012

机译：基于深度神经网络的自动语音识别的多分辨率语音分析：TIMIT实验
7. Deep neural network features and semi-supervised training for low resource speech recognition [O] . Samuel Thomas, Michael L. Seltzer, Kenneth Church, 2013

机译：低资源语音识别的深度神经网络特征和半监督训练

ELASTIC SPECTRAL DISTORTION FOR LOW RESOURCE SPEECH RECOGNITION WITH DEEP NEURAL NETWORKS

摘要

著录项

相似文献

相关主题

期刊订阅