...
首页> 外文期刊>Expert Systems with Application >A new optimum feature extraction and classification method for speaker recognition: GWPNN
【24h】

A new optimum feature extraction and classification method for speaker recognition: GWPNN

机译:一种新的说话人识别最佳特征提取与分类方法:GWPNN

获取原文
获取原文并翻译 | 示例
           

摘要

Speech and speaker recognition is an important topic to be performed by a computer system. In this paper, an expert speaker recognition system based on optimum wavelet packet entropy is proposed for speaker recognition by using real speech/voice signal. This study contains both the combination of the new feature extraction and classification approach by using optimum wavelet packet entropy parameter values. These optimum wavelet packet entropy values are obtained from measured real English language speech/voice signal waveforms using speech experimental set. A genetic-wavelet packet-neural network (GWPNN) model is developed in this study. GWPNN includes three layers which are genetic algorithm, wavelet packet and multi-layer perception. The genetic algorithm layer of GWPNN is used for selecting the feature extraction method and obtaining the optimum wavelet entropy parameter values. In this study, one of the four different feature extraction methods is selected by using genetic algorithm. Alternative feature extraction methods are wavelet packet decomposition, wavelet packet decomposition - short-time Fourier transform, wavelet packet decomposition - Born-Jordan time-frequency representation, wavelet packet decomposition - Choi-Williams time-frequency representation. The wavelet packet layer is used for optimum feature extraction in the time-frequency domain and is composed of wavelet packet decomposition and wavelet packet entropies. The multi-layer perceptron of GWPNN, which is a feed-forward neural network, is used for evaluating the fitness function of the genetic algorithm and for classification speakers. The performance of the developed system has been evaluated by using noisy English speech/voice signals. The test results showed that this system was effective in detecting real speech signals. The correct classification rate was about 85% for speaker classification.
机译:语音和说话者识别是计算机系统要执行的重要主题。本文提出了一种基于最优小波包熵的专家说话人识别系统,用于基于真实语音/语音信号的说话人识别。这项研究通过使用最佳小波包熵参数值,将新特征提取和分类方法结合在一起。这些最佳的小波包熵值是使用语音实验集从实测英语语音/语音信号波形中获得的。本研究建立了遗传小波包神经网络(GWPNN)模型。 GWPNN包括遗传算法,小波包和多层感知三层。 GWPNN的遗传算法层用于选择特征提取方法并获得最佳小波熵参数值。在这项研究中,使用遗传算法选择了四种不同的特征提取方法之一。可选的特征提取方法是小波包分解,小波包分解-短时傅立叶变换,小波包分解-Born-Jordan时频表示,小波包分解-Choi-Williams时频表示。小波包层用于时频域的最优特征提取,由小波包分解和小波包熵组成。 GWPNN的多层感知器是一种前馈神经网络,用于评估遗传算法的适应度函数和分类说话人。已通过使用嘈杂的英语语音/语音信号评估了开发系统的性能。测试结果表明,该系统可有效检测真实语音信号。说话人分类的正确分类率约为85%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号