【24h】

Deep Learning Based Bangla Speech-to-Text Conversion

机译:基于深度学习的Bangla语音到文本转换

获取原文
获取原文并翻译 | 示例

摘要

Speech-To-Text conversion is the process of recognizing speech in audio and producing a text transcript for it. Due to speech being such an intuitive medium of communication, this technology can have far reaching effects in easing the interaction between humans and machine. This paper presents a complete speech-to-text conversion system for the Bangla language (also known as Bengali) using Deep Recurrent Neural Networks. Possible optimization such as Broken Language Format has been proposed which is based on properties of the Bangla Language for reducing the training time of the network. A simple deep recurrent neural network architecture has been used for speech recognition. It was trained with collected data and which yielded over 95% accuracy in case of training data and 50% accuracy in case of testing data.
机译:语音到文本转换是识别音频中的语音并为其生成文本记录的过程。由于语音是一种直观的通信媒介,因此该技术在简化人机交互方面具有深远的影响。本文介绍了一种使用深度递归神经网络的孟加拉语(也称为孟加拉语)语言的完整语音到文本转换系统。已经提出了可能的优化方法,例如“折断的语言格式”,该方法基于孟加拉语言的属性来减少网络的训练时间。一种简单的深度递归神经网络架构已用于语音识别。它使用收集的数据进行了训练,在训练数据的情况下,其准确性超过95%,在测试数据的情况下,其准确性达到50%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号