...
首页> 外文期刊>Multimedia Tools and Applications >Adaptive recognition of different accents conversations based on convolutional neural network
【24h】

Adaptive recognition of different accents conversations based on convolutional neural network

机译:基于卷积神经网络的不同口音会话的自适应识别

获取原文
获取原文并翻译 | 示例

摘要

In this paper, an adaptive recognition of different accents conversations is proposed based on Convolutional Neural Network(CNN), which is used to deal with dialogue speech recognition problems that contain different accents in the CALL_CENTER environment. For the first time, the Mel-Frequency Cepstral Coefficients (MFCC) feature and the SPECTROGRAM feature are combined as the input of CNN to train the speakers' voice feature model and to estimate the change point. Then, an accent classification method based on weighted fusion feature is proposed, and we introduced the IFLY voice recognition system to propose different accent dialogue recognition models based on speaker segmentation. In the experiments, a real database about the dialogue voice related to insurance sales and real estate sales industry is used to be dataset. After a comparative experiment, the results show that the word error rate for speech recognition after speaker segmentation and accent classification was reduced by 20% compared to the original speech recognition word error rate.
机译:本文提出了一种基于卷积神经网络(CNN)的不同口音会话的自适应识别方法,该方法用于处理CALL_CENTER环境中包含不同口音的对话语音识别问题。首次将梅尔频率倒谱系数(MFCC)功能和SPECTROGRAM功能组合为CNN的输入,以训练扬声器的语音特征模型并估计变化点。然后,提出了一种基于加权融合特征的口音分类方法,并引入了IFLY语音识别系统,提出了基于说话人分割的不同口音对话识别模型。在实验中,有关保险销售和房地产销售行业的对话语音的真实数据库被用作数据集。经过对比实验,结果表明,与原始语音识别单词错误率相比,说话人分割和口音分类后的语音识别单词错误率降低了20%。

著录项

  • 来源
    《Multimedia Tools and Applications》 |2019年第21期|30749-30767|共19页
  • 作者

    Zhong Jiang; Zhang Pan; Li Xue;

  • 作者单位

    Chongqing Univ Coll Comp Sci Chongqing 400030 Peoples R China|Chongqing Univ Key Lab Dependable Serv Comp Cyber Phys Soc Minist Educ Chongqing 400030 Peoples R China;

    Chongqing Univ Coll Comp Sci Chongqing 400030 Peoples R China|China United Network Commun Co Ltd Xian Branch Xian 710065 Shaanxi Peoples R China;

    Univ Queensland Sch Informat Technol & Elect Engn Brisbane Qld Australia;

  • 收录信息
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

    Combined feature; Speaker segmentation; Accent classification; Speech recognition;

    机译:组合功能;说话人细分;口音分类;语音识别;

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号