首页> 外文期刊>Computational Intelligence >Taylor-AMS features and deep convolutional neural network for converting nonaudible murmur to normal speech
【24h】

Taylor-AMS features and deep convolutional neural network for converting nonaudible murmur to normal speech

机译:泰勒-AMS特征和深度卷积神经网络,用于将非Authautible Murmur转换为正常演讲

获取原文
获取原文并翻译 | 示例
获取外文期刊封面目录资料

摘要

Communication becomes effective when the speech signal arrives with the profound characteristics. This insisted the researchers to develop an automatic system of recognizing the speech signals from the murmurs. Some of the traditional automatic recognition systems are unfit for the silent environments imposing a need for an effective recognition system. Also, the traditional automatic recognition methods, like Neural Networks, render poor performance in the presence of the murmurs. Thus, this article proposes a method for automatic whisper recognition using the Deep Convolutional Neural Network (DCNN). The training of the DCNN is performed using the proposed Stochastic-Whale Optimization Algorithm (Stochastic-WOA), which is designed by the integration of Stochastic Gradient Descent algorithm with WOA. The input to the classifier is the features that include pitch chroma, spectral centroid, spectral skewness, and Taylor-Amplitude Modulation Spectrogram (Taylor-AMS), which is obtained by combining Taylor series and Amplitude Modulation Spectrogram (AMS) features, of the preprocessed input speech signal. The experimentation of the method is performed using the real database and the analysis proves that the proposed method acquired a maximal accuracy of 0.9723, minimal False Positive Rate of 0.0257, and maximal True Positive Rate of 0.9981, respectively.
机译:当语音信号以深刻特征到达时,通信变得有效。这使研究人员能够开发一种识别来自杂音的语音信号的自动系统。一些传统的自动识别系统对于静默环境强加有效识别系统的静音环境不合适。此外,传统的自动识别方法,如神经网络,在杂音的存在下表现不佳。因此,本文提出了一种使用深卷积神经网络(DCNN)自动耳语识别的方法。使用所提出的随机鲸鲸优化算法(随机WOA)进行DCNN的训练,其是通过与WOA的随机梯度下降算法的集成而设计的。分类器的输入是通过将预处理的泰勒序列和幅度调制谱图(AMS)特征组合来获得的节奏色谱,光谱质心,光谱偏振和泰勒幅度调制谱图(Taylor-AMS)的特征。输入语音信号。该方法的实验是使用真实数据库进行的,分析证明,所提出的方法获得了0.9723的最大精度,最小的假阳性率为0.0257,并分别为0.9981的最大真实阳性率。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号