首页> 外文会议>International Symposium on Chinese Spoken Language Processing >Mandarin speech recognition using convolution neural network with augmented tone features
【24h】

Mandarin speech recognition using convolution neural network with augmented tone features

机译:卷积神经网络增强语音特征的普通话语音识别

获取原文

摘要

Due to its ability of reducing spectral variations and modeling spectral correlations existed in speech signals, the convolutional neural network (CNN) has been shown effective in modeling speech compared to deep neural network (DNN). In this study, we explore applying CNN to Mandarin speech recognitions. Besides exploring appropriate CNN architecture for recognition performance, focuses are on investigating the effective acoustic features, and effectivenesses of applying tonal information which have been verified helpful in other types of acoustic models to the acoustic features in the CNN. We conduct speech recognition experiments on Mandarin broadcast speech recognition to test the effectivenesses of the proposed approaches. The CNN shows its clear superiority to the DNN, with relative reductions of character error rate (CER) among 7.7–13.1% for broadcast news speech (BN), and 5.4–9.9% for broadcast conversation speech (BC). Like in the Gaussian Mixture Model (GMM) and DNN systems, the tonal information characterized by the fundamental frequency (F) and fundamental frequency variations (FFV) are found still helpful in CNN models, they achieve relative CER reductions over 6.7% for BN and 4.3% for BC respectively when compared with the baseline Mel-filter bank feature.
机译:由于卷积神经网络(CNN)可以减少语音信号中的频谱变化并对频谱相关性进行建模,因此与深层神经网络(DNN)相比,已显示出对语音建模的有效效果。在这项研究中,我们探索将CNN应用于普通话语音识别。除了探索适当的CNN架构以提高识别性能外,重点还在于调查有效的声学特征以及将已证明在其他类型的声学模型中有用的音调信息应用于CNN的声学特征的有效性。我们对普通话广播语音识别进行语音识别实验,以测试所提出方法的有效性。 CNN表现出明显优于DNN的优势,广播新闻语音(BN)的字符错误率(CER)相对降低了7.7–13.1%,广播对话语音(BC)的字符错误率(CER)降低了5.4–9.9%。像在高斯混合模型(GMM)和DNN系统中一样,以基频(F)和基频变化(FFV)为特征的音调信息在CNN模型中仍然很有帮助,对于BN和CN而言,它们的CER相对降低了6.7%以上。与基线Mel-filter bank功能相比,BC分别为4.3%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号