首页> 外文会议>International Conference on speech and computer >End-to-End Speech Recognition in Russian
【24h】

End-to-End Speech Recognition in Russian

机译:俄语端到端语音识别

获取原文

摘要

End-to-end speech recognition systems incorporating deep neural networks (DNNs) have achieved good results. We propose applying CTC (Connectionist Temporal Classification) models and attention-based encoder-decoder in automatic recognition of the Russian continuous speech. We used different neural network models such Long short-term memory (LSTM), bidirectional LSTM and Residual Networks to provide experiments. We got recognition accuracy a bit worse than hybrid models but our models can work without large language model and they showed better performance in terms of average decoding speed that can be helpful in real systems. Experiments are performed with extra-large vocabulary (more than 150K words) of Russian speech.
机译:结合深度神经网络(DNN)的端到端语音识别系统取得了良好的效果。我们建议在自动识别俄语连续语音中应用CTC(连接器时间分类)模型和基于注意力的编解码器。我们使用了不同的神经网络模型,例如长短期记忆(LSTM),双向LSTM和残差网络来提供实验。我们的识别精度比混合模型差一点,但是我们的模型可以在没有大型语言模型的情况下工作,并且在平均解码速度方面表现出更好的性能,这对实际系统很有帮助。实验使用超大词汇(超过15万个单词)的俄语语音进行。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号