首页> 外文会议>日本音響学会研究発表会 >Speech Accent and Gender Recognition using Dilated- Convolution Neural Network with Skip and Residual Connection

【24h】

Speech Accent and Gender Recognition using Dilated- Convolution Neural Network with Skip and Residual Connection

机译：使用跳过和残差连接使用扩张 - 卷积神经网络的语音口音和性别识别

获取原文

获取外文期刊封面目录资料

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

This paper reports our speech accent and genderrecognition system for the Vietnamese language.Prior studies have shown that the temporal structureof speech also contains significant cues for speechaccent and gender. However, conventional CNNcannot have large filter size as it increases thenetwork complexity. Inspired by the success ofWaveNet, we propose using the dilatedconvolutional neural network (dilated-CNN) withskip- and residual-connection to better capture thespeech temporal structure. The experiment resultsshow that our proposed architecture achieves higherperformance compared to non-dilated CNN.

机译：本文报告了我们的言语重音和性别越南语的识别系统。事先研究表明时间结构讲话也包含了言语的重要提示口音和性别。但是，常规CNN不能具有大的过滤器尺寸随着它的增加网络复杂性。灵感来自成功Wavenet，我们建议使用扩张卷积神经网络（扩张-CNN）与跳过和剩余连接以更好地捕获语音时间结构。实验结果表明我们拟议的建筑达到更高与非扩张CNN相比的性能。

著录项

来源
《日本音響学会研究発表会》|2019年|xlviii 143 p.|共5页
会议地点
作者
Tuan Vu Ho; Masato Akagi;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类声学工程;
关键词

相似文献

外文文献
中文文献
专利

1. Speech Emotion Recognition based on Multi-Level Residual Convolutional Neural Networks [J] . Kai Zheng, ZhiGuang Xia, Yi Zhang, Engineering Letters . 2020,第2期

机译：基于多级残余卷积神经网络的语音情感识别
2. Residual connection-based graph convolutional neural networks for gait recognition [J] . Shopon Md, Bari A. S. M. Hossain, Gavrilova Marina L. The Visual Computer . 2021,第9a11期

机译：基于残余连接的图形卷积神经网络进行步态识别
3. Attention-Based Convolution Skip Bidirectional Long Short-Term Memory Network for Speech Emotion Recognition [J] . Huiyun Zhang, Heming Huang, Henry Han Quality Control, Transactions . 2021,第1期

机译：基于注意力的卷积跳过双向长期短期记忆网络，用于语音情感识别
4. Speech Accent and Gender Recognition using Dilated- Convolution Neural Network with Skip and Residual Connection [C] . Tuan Vu Ho, Masato Akagi 日本音響学会2019年春季研究発表会講演論文集 . 2019

机译：带有跳过和残差连接的扩散卷积神经网络的语音口音和性别识别
5. Convolutional Neural Networks for Speaker-Independent Speech Recognition. [D] . Belilovsky, Eugene. 2011

机译：用于与说话人无关的语音识别的卷积神经网络。
6. A Neural Network with Convolutional Module and Residual Structure for Radar Target Recognition Based on High-Resolution Range Profile [O] . Zhequan Fu, Shangsheng Li, Xiangping Li, 2020

机译：基于卷积模块和残差结构的神经网络基于高分辨率距离剖面的雷达目标识别
7. Convolutional Neural Networks Using Skip Connections with Layer Groups for Super-Resolution Image Reconstruction Based on Deep Learning [O] . Hyeongyeom Ahn, Changhoon Yim 2020

机译：基于深度学习的超分辨率图像重建，卷积神经网络使用与层组进行跳过连接

Speech Accent and Gender Recognition using Dilated- Convolution Neural Network with Skip and Residual Connection

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅