End-to-End Speech Recognition in Russian

机译：俄语端到端语音识别

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

End-to-end speech recognition systems incorporating deep neural networks (DNNs) have achieved good results. We propose applying CTC (Connectionist Temporal Classification) models and attention-based encoder-decoder in automatic recognition of the Russian continuous speech. We used different neural network models such Long short-term memory (LSTM), bidirectional LSTM and Residual Networks to provide experiments. We got recognition accuracy a bit worse than hybrid models but our models can work without large language model and they showed better performance in terms of average decoding speed that can be helpful in real systems. Experiments are performed with extra-large vocabulary (more than 150K words) of Russian speech.

机译：结合深度神经网络（DNN）的端到端语音识别系统取得了良好的效果。我们建议在自动识别俄语连续语音中应用CTC（连接器时间分类）模型和基于注意力的编解码器。我们使用了不同的神经网络模型，例如长短期记忆（LSTM），双向LSTM和残差网络来提供实验。我们的识别精度比混合模型差一点，但是我们的模型可以在没有大型语言模型的情况下工作，并且在平均解码速度方面表现出更好的性能，这对实际系统很有帮助。实验使用超大词汇（超过15万个单词）的俄语语音进行。

著录项

来源
《International Conference on speech and computer》|2018年|377-386|共10页
会议地点
作者
Nikita Markovnikov; Irina Kipyatkova; Elena Lyakso;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
End-to-end models; Deep learning; Russian speech; Speech recognition;

机译：端到端模型;深度学习;俄语演讲;语音识别;

相似文献

外文文献
中文文献
专利

1. Bridging automatic speech recognition and psycholinguistics: Extending Shortlist to an end-to-end model of human speech recognition (L) [J] . Odette Scharenborg, Louis ten Bosch, Lou Boves, The Journal of the Acoustical Society of America . 2003,第6期

机译：桥接自动语音识别和心理语言学：将候选清单扩展到人类语音识别的端到端模型（L）
2. Representation transfer learning from deep end-to-end speech recognition networks for the classification of health states from speech [J] . Benjamin Sertolli, Zhao Ren, Bjoern W. Schuller, Computer speech and language . 2021,第Jula期

机译：从言语中，从深端到端语音识别网络中的代表转移学习
3. An End-to-End Deep Learning Approach to Simultaneous Speech Dereverberation and Acoustic Modeling for Robust Speech Recognition [J] . Bo Wu, Kehuang Li, Fengpei Ge, Selected Topics in Signal Processing, IEEE Journal of . 2017,第8期

机译：端到端深度学习方法可同时进行语音去混响和声学建模，以实现可靠的语音识别
4. Investigating Joint CTC-Attention Models for End-to-End Russian Speech Recognition [C] . Nikita Markovnikov, Irina Kipyatkova International Conference on Speech and Computer . 2019

机译：研究用于端到端俄语语音识别的联合CTC注意模型
5. End-to-End Speech Recognition on Conversations [D] . Kim, Suyoun . 2019

机译：对话的端到端语音识别
6. Dynamic Acoustic Unit Augmentation with BPE-Dropout for Low-Resource End-to-End Speech Recognition [O] . Aleksandr Laptev, Andrei Andrusenko, Ivan Podluzhny, 2021

机译：用BPE-ropout进行动态声学单元增强用于低资源端到端语音识别
7. Bridging Automatic Speech Recognition and Psycholinguistics: Extending Shortlist to an End-to-End Model of Human Speech Recognition [O] . Scharenborg O.E., Bosch L.F.M. ten, Boves L.W.J., 2003

机译：桥接自动语音识别和心理语言学：将候选清单扩展到人类语音识别的端到端模型

End-to-End Speech Recognition in Russian

摘要

著录项

相似文献

相关主题

期刊订阅