Lexicon-Free Conversational Speech Recognition with Neural Networks

机译：神经网络的无词典对话语音识别

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

We present an approach to speech recognition that uses only a neural network to map acoustic input to characters, a character-level language model, and a beam search decoding procedure. This approach eliminates much of the complex infrastructure of modern speech recognition systems, making it possible to directly train a speech recognizer using errors generated by spoken language understanding tasks. The system naturally handles out of vocabulary words and spoken word fragments. We demonstrate our approach using the challenging Switchboard telephone conversation transcription task, achieving a word error rate competitive with existing baseline systems. To our knowledge, this is the first entirely neural-network-based system to achieve strong speech transcription results on a conversational speech task. We analyze qualitative differences between transcriptions produced by our lexicon-free approach and transcriptions produced by a standard speech recognition system. Finally, we evaluate the impact of large context neural network character language models as compared to standard n-gram models within our framework.

机译：我们提出了一种语音识别方法，该方法仅使用神经网络将声学输入映射到字符，字符级语言模型和波束搜索解码过程。这种方法消除了现代语音识别系统的许多复杂基础结构，从而可以使用口语理解任务所产生的错误直接训练语音识别器。该系统自然可以处理词汇单词和口语单词片段。我们使用具有挑战性的总机电话对话转录任务演示了我们的方法，实现了与现有基准系统相媲美的单词错误率。据我们所知，这是第一个完全基于神经网络的系统，可以在会话语音任务中实现强大的语音转录结果。我们分析了无词典方法产生的转录与标准语音识别系统产生的转录之间的质量差异。最后，我们评估了与我们框架内的标准n-gram模型相比，大上下文神经网络字符语言模型的影响。

著录项

来源
《Conference on the North American Chapter of the Association for Computational Linguistics: Human Language Technologies 》|2015年|345-354|共10页
会议地点
作者
Andrew L. Maas; Ziang Xie; Dan Jurafsky; Andrew Y. Ng;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. Recent advances in conversational speech recognition using convolutional and recurrent neural networks [J] . G. Saon, M. Picheny IBM Journal of Research and Development . 2017 ,第4期

机译：使用卷积和递归神经网络进行对话语音识别的最新进展
2. A Speaker-Dependent Approach to Single-Channel Joint Speech Separation and Acoustic Modeling Based on Deep Neural Networks for Robust Recognition of Multi-Talker Speech [J] . Yan-Hui Tu, Jun Du, Chin-Hui Lee Journal of signal processing systems for signal, image, and video technology . 2018 ,第7期

机译：基于说话者的基于深度神经网络的单通道联合语音分离和声学建模方法，用于多语音对话的鲁棒识别
3. Recognition of words from brain-generated signals of speech-impaired people: Application of autoencoders as a neural Turing machine controller in deep neural networks [J] . Boloukian Behzad, Safi-Esfahani Faramarz Neural Networks: The Official Journal of the International Neural Network Society . 2020 ,第期

机译：识别语音障碍的脑生成信号的单词：AutoEncoders在深神经网络中的神经图定型机控制器中的应用
4. Lexicon-Free Conversational Speech Recognition with Neural Networks [C] . Andrew L. Maas, Ziang Xie, Dan Jurafsky, Conference on the North American Chapter of the Association for Computational Linguistics: Human Language Technologies . 2015

机译：与神经网络的词典无序会话语音识别
5. Dysarthric Speech Recognition and Offline Handwriting Recognition using Deep Neural Networks. [D] . Pillai, Suhas Balkrishna. 2017

机译：使用深度神经网络的表情异常语音识别和离线手写识别。
6. Multi-resolution speech analysis for automatic speech recognition using deep neural networks: Experiments on TIMIT [O] . Doroteo T. Toledano, María Pilar Fernández-Gallego, Alicia Lozano-Diez 2012

机译：基于深度神经网络的自动语音识别的多分辨率语音分析：TIMIT实验
7. TRAPping conversational speech: Extending TRAP/Tandem approaches to conversational telephone speech recognition [O] . Nelson Morgan, Barry Y. Chen, Qifeng Zhu, 2004

机译：捕获对话语音：将TRAP / Tandem方法扩展到会话电话语音识别

Lexicon-Free Conversational Speech Recognition with Neural Networks

摘要

著录项

相似文献

相关主题

期刊订阅