End-to-End Mandarin Recognition based on Convolution Input

Yanzhe Wang; LiMin Zhang; Bingqiang Zhang; Zhenyu Li

首页> 外文期刊>MATEC Web of Conferences >End-to-End Mandarin Recognition based on Convolution Input

【24h】

End-to-End Mandarin Recognition based on Convolution Input

机译：基于卷积输入的端到端普通话识别

获取原文

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

The cross-entropy criterion of mainstream neural network training is to classify and optimize each frame of acoustic data, while the continuous speech recognition uses the sequence-level transcription accuracy as a performance measure. In view of this difference, an end-to-end speech recognition system based on sequence level transcription is constructed in this paper. The model uses convolution neural network to deal with the input features, selects the best network structure, and performs two-dimensional convolution in the time and frequency domains. At the same time, neural network uses batch normalization technology to reduce generalization error and speed up training. Finally, the hyper-parameters in decoding process are optimized to improve the modelling effect. Experimental results show that the system performance is improved a lot, better than mainstream speech recognition systems.

机译：主流神经网络训练的交叉熵准则是对声学数据的每一帧进行分类和优化，而连续语音识别则使用序列级转录准确性作为性能指标。针对这种差异，本文构建了一种基于序列水平转录的端到端语音识别系统。该模型使用卷积神经网络处理输入特征，选择最佳网络结构，并在时域和频域执行二维卷积。同时，神经网络使用批量归一化技术来减少泛化错误并加快训练速度。最后，对解码过程中的超参数进行优化，以提高建模效果。实验结果表明，该系统的性能有很大提高，优于主流语音识别系统。

著录项

来源
《MATEC Web of Conferences》 |2018年第1期|共5页
作者
Yanzhe Wang; LiMin Zhang; Bingqiang Zhang; Zhenyu Li;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种
中图分类一般工业技术;
关键词

相似文献

外文文献
中文文献
专利

1. EEG-based emotion recognition using an end-to-end regional-asymmetric convolutional neural network [J] . Cui Heng, Liu Aiping, Zhang Xu, Knowledge-Based Systems . 2020,第Octa12期

机译：基于EEG的情感识别，使用端到端的区域 - 不对称卷积神经网络
2. An End-to-End Steel Strip Surface Defects Recognition System Based on Convolutional Neural Networks [J] . Yi Li, Li Guangyao, Jiang Mingming Steel Research International . 2017,第2期

机译：基于卷积神经网络的端到端钢带表面缺陷识别系统
3. On Input/Output Architectures for Convolutional Neural Network-Based Cross-View Gait Recognition [J] . Noriko Takemura, Yasushi Makihara, Daigo Muramatsu, IEEE Transactions on Circuits and Systems for Video Technology . 2019,第9期

机译：基于卷积神经网络的交叉步态识别的输入/输出架构
4. End-to-End Mandarin Recognition based on Convolution Input [C] . WANG Yanzhe, ZHANG LiMin, ZHANG Bingqiang, International Conference on Information Processing and Control Engineering . 2018

机译：基于卷积输入的端到端普通话识别
5. Face Recognition Based on Convolutional Neural Network Using Improved AlexNet in MATLAB [D] . Zhu, Jiping. 2021

机译：基于卷积神经网络使用改进的Matlab alexnet的人脸识别
6. A Novel Time-Incremental End-to-End Shared Neural Network with Attention-Based Feature Fusion for Multiclass Motor Imagery Recognition [O] . Shidong Lian, Jialin Xu, Guokun Zuo, 2021

机译：一种新的时增量端到端共享神经网络具有基于关注的特征融合用于多字母电机图像识别
7. Attentive Convolutional Neural Network based Speech Emotion Recognition: A Study on the Impact of Input Features, Signal Length, and Acted Speech [O] . Neumann, Michael, Vu, Ngoc Thang 2017

机译：基于卷积神经网络的语音情感识别：输入特征，信号长度和作用语音的影响研究

End-to-End Mandarin Recognition based on Convolution Input

摘要

著录项

相似文献

相关主题

期刊订阅