Single-channel mixed speech recognition using deep neural networks

机译：使用深度神经网络的单通道混合语音识别

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

In this work, we study the problem of single-channel mixed speech recognition using deep neural networks (DNNs). Using a multi-style training strategy on artificially mixed speech data, we investigate several different training setups that enable the DNN to generalize to corresponding similar patterns in the test data. We also introduce a WFST-based two-talker decoder to work with the trained DNNs. Experiments on the 2006 speech separation and recognition challenge task demonstrate that the proposed DNN-based system has remarkable noise robustness to the interference of a competing speaker. The best setup of our proposed systems achieves an overall WER of 19.7% which improves upon the results obtained by the state-of-the-art IBM superhuman system by 1.9% absolute, with fewer assumptions and lower computational complexity.

机译：在这项工作中，我们使用深神经网络（DNN）研究单通道混合语音识别的问题。在人工混合语音数据上使用多种式培训策略，我们调查几种不同的训练设置，使DNN能够概括到测试数据中的相应类似模式。我们还介绍了一个基于WFST的双讲话者解码器，可以使用培训的DNN。 2006年演讲分离和识别挑战任务的实验表明，拟议的基于DNN的系统对竞争扬声器的干扰具有显着的噪声鲁棒性。我们所提出的系统的最佳设置实现了19.7％的整体增长，这提高了由最先进的IBM超人系统获得的结果1.9％，具有较少的假设和更低的计算复杂性。

著录项

来源
《IEEE International Conference on Acoustics, Speech and Signal Processing》|2014年|5632-5636|共5页
会议地点
作者
Weng Chao; Yu Dong; Seltzer Michael L.; Droppo Jasha;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
DNN; WFST; multi-talker ASR;

机译：DNN; WFST;多通话者ASR;

相似文献

外文文献
中文文献
专利

1. A Speaker-Dependent Approach to Single-Channel Joint Speech Separation and Acoustic Modeling Based on Deep Neural Networks for Robust Recognition of Multi-Talker Speech [J] . Yan-Hui Tu, Jun Du, Chin-Hui Lee Journal of signal processing systems for signal, image, and video technology . 2018,第7期

机译：基于说话者的基于深度神经网络的单通道联合语音分离和声学建模方法，用于多语音对话的鲁棒识别
2. Deep Neural Networks for Single-Channel Multi-Talker Speech Recognition [J] . Weng Chao, Yu Dong, Seltzer Michael L., Audio, Speech, and Language Processing, IEEE/ACM Transactions on . 2015,第10期

机译：深度神经网络用于单通道多口语语音识别
3. A Regression Approach to Single-Channel Speech Separation Via High-Resolution Deep Neural Networks [J] . Jun Du, Yanhui Tu, Li-Rong Dai, Audio, Speech, and Language Processing, IEEE/ACM Transactions on . 2016,第8期

机译：高分辨率深度神经网络的单通道语音分离的回归方法
4. SINGLE-CHANNEL MIXED SPEECH RECOGNITION USING DEEP NEURAL NETWORKS [C] . Chao Weng, Dong Yu, Michael L. Seltzer, IEEE International Conference on Acoustics, Speech and Signal Processing . 2014

机译：使用深神经网络的单通道混合语音识别
5. Dysarthric Speech Recognition and Offline Handwriting Recognition using Deep Neural Networks. [D] . Pillai, Suhas Balkrishna. 2017

机译：使用深度神经网络的表情异常语音识别和离线手写识别。
6. Multi-resolution speech analysis for automatic speech recognition using deep neural networks: Experiments on TIMIT [O] . Doroteo T. Toledano, María Pilar Fernández-Gallego, Alicia Lozano-Diez 2012

机译：基于深度神经网络的自动语音识别的多分辨率语音分析：TIMIT实验
7. Single-channel dereverberation by feature mapping using cascade neural networks for robust distant speaker identification and speech recognition [O] . Aditya Arie Nugraha, Kazumasa Yamamoto, Seiichi Nakagawa 2014

机译：通过使用级联神经网络的特征映射进行单声道去混响，以实现可靠的远距离说话者识别和语音识别

Single-channel mixed speech recognition using deep neural networks

摘要

著录项

相似文献

相关主题

期刊订阅