首页> 外文会议>日本音響学会研究発表会 >Speech Accent and Gender Recognition using Dilated- Convolution Neural Network with Skip and Residual Connection
【24h】

Speech Accent and Gender Recognition using Dilated- Convolution Neural Network with Skip and Residual Connection

机译:使用跳过和残差连接使用扩张 - 卷积神经网络的语音口音和性别识别

获取原文
获取外文期刊封面目录资料

摘要

This paper reports our speech accent and genderrecognition system for the Vietnamese language.Prior studies have shown that the temporal structureof speech also contains significant cues for speechaccent and gender. However, conventional CNNcannot have large filter size as it increases thenetwork complexity. Inspired by the success ofWaveNet, we propose using the dilatedconvolutional neural network (dilated-CNN) withskip- and residual-connection to better capture thespeech temporal structure. The experiment resultsshow that our proposed architecture achieves higherperformance compared to non-dilated CNN.
机译:本文报告了我们的言语重音和性别越南语的识别系统。事先研究表明时间结构讲话也包含了言语的重要提示口音和性别。但是,常规CNN不能具有大的过滤器尺寸随着它的增加网络复杂性。灵感来自成功Wavenet,我们建议使用扩张卷积神经网络(扩张-CNN)与跳过和剩余连接以更好地捕获语音时间结构。实验结果表明我们拟议的建筑达到更高与非扩张CNN相比的性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号