首页> 外文会议>立体映像技術研究会;オーディオビジュアル複合情報処理研究会;スマートインフォメディアシステム研究会 >An experimental comparison of CNN- and CRNN-CTC for automatic phrase speech recognition systems using a children's speech database
【24h】

An experimental comparison of CNN- and CRNN-CTC for automatic phrase speech recognition systems using a children's speech database

机译:使用儿童语音数据库的自动短语语音识别系统的CNN和CRNN-CTC的实验比较

获取原文

摘要

Children's speech recognition is still a challenging issue. In the case of children's speeches, the accuracy of conventional phrase speech recognition approaches is significantly low. This is mainly owing to the high variability of pronunciation patterns due to children's physical activity. Motivated by this, in this paper, we present a phrase speech recognition system using neural networks. We use a convolutional neural network (CNNs) and its recurrent neural network (RNN) version, say CRNN. Also, both approaches utilize a connectionist temporal classification (CTC) loss function, which allows networks to be trained without any prior alignment. Through experiments using a children's speech database, we show the comparison results of CNN- and CRNN-CTC approaches.
机译:儿童的语音识别仍然是一个具有挑战性的问题。在儿童语音的情况下,常规短语语音识别方法的准确性非常低。这主要是由于儿童的身体活动导致发音模式的高度可变性所致。为此,在本文中,我们提出了一种使用神经网络的短语语音识别系统。我们使用卷积神经网络(CNN)及其递归神经网络(RNN)版本,例如CRNN。而且,这两种方法都利用了连接主义的时间分类(CTC)损失函数,该函数允许无需任何事先对准即可对网络进行训练。通过使用儿童语音数据库的实验,我们显示了CNN和CRNN-CTC方法的比较结果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号