A new optimum feature extraction and classification method for speaker recognition: GWPNN

Engin Avci

首页> 外文期刊>Expert Systems with Application >A new optimum feature extraction and classification method for speaker recognition: GWPNN

【24h】

A new optimum feature extraction and classification method for speaker recognition: GWPNN

机译：一种新的说话人识别最佳特征提取与分类方法：GWPNN

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Speech and speaker recognition is an important topic to be performed by a computer system. In this paper, an expert speaker recognition system based on optimum wavelet packet entropy is proposed for speaker recognition by using real speech/voice signal. This study contains both the combination of the new feature extraction and classification approach by using optimum wavelet packet entropy parameter values. These optimum wavelet packet entropy values are obtained from measured real English language speech/voice signal waveforms using speech experimental set. A genetic-wavelet packet-neural network (GWPNN) model is developed in this study. GWPNN includes three layers which are genetic algorithm, wavelet packet and multi-layer perception. The genetic algorithm layer of GWPNN is used for selecting the feature extraction method and obtaining the optimum wavelet entropy parameter values. In this study, one of the four different feature extraction methods is selected by using genetic algorithm. Alternative feature extraction methods are wavelet packet decomposition, wavelet packet decomposition - short-time Fourier transform, wavelet packet decomposition - Born-Jordan time-frequency representation, wavelet packet decomposition - Choi-Williams time-frequency representation. The wavelet packet layer is used for optimum feature extraction in the time-frequency domain and is composed of wavelet packet decomposition and wavelet packet entropies. The multi-layer perceptron of GWPNN, which is a feed-forward neural network, is used for evaluating the fitness function of the genetic algorithm and for classification speakers. The performance of the developed system has been evaluated by using noisy English speech/voice signals. The test results showed that this system was effective in detecting real speech signals. The correct classification rate was about 85% for speaker classification.

机译：语音和说话者识别是计算机系统要执行的重要主题。本文提出了一种基于最优小波包熵的专家说话人识别系统，用于基于真实语音/语音信号的说话人识别。这项研究通过使用最佳小波包熵参数值，将新特征提取和分类方法结合在一起。这些最佳的小波包熵值是使用语音实验集从实测英语语音/语音信号波形中获得的。本研究建立了遗传小波包神经网络（GWPNN）模型。 GWPNN包括遗传算法，小波包和多层感知三层。 GWPNN的遗传算法层用于选择特征提取方法并获得最佳小波熵参数值。在这项研究中，使用遗传算法选择了四种不同的特征提取方法之一。可选的特征提取方法是小波包分解，小波包分解-短时傅立叶变换，小波包分解-Born-Jordan时频表示，小波包分解-Choi-Williams时频表示。小波包层用于时频域的最优特征提取，由小波包分解和小波包熵组成。 GWPNN的多层感知器是一种前馈神经网络，用于评估遗传算法的适应度函数和分类说话人。已通过使用嘈杂的英语语音/语音信号评估了开发系统的性能。测试结果表明，该系统可有效检测真实语音信号。说话人分类的正确分类率约为85％。

著录项

来源
《Expert Systems with Application》 |2007年第2期|p.485-498|共14页
作者
Engin Avci;
展开▼
作者单位

Firat University, Technical Education Faculty, Department of Electronic and Computer Science, 23119 Elazig, Turkey;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类人工智能理论;
关键词
english speech signal; adaptive feature extraction; wavelet packet decomposition; entropy; genetic algorithm; wavelet packet-neural networks; expert system;

机译：英语语音信号自适应特征提取小波包分解熵遗传算法小波包神经网络专家系统;

相似文献

外文文献
中文文献
专利

1. Optimal feature extraction methods for classification methods and their applications to biometric recognition [J] . Yin Jun, Zeng Weiming, Wei Lai Knowledge-Based Systems . 2016,第may1期

机译：分类方法的最佳特征提取方法及其在生物识别中的应用
2. Comparison of Speaker Adaptation Methods as Feature Extraction for SVM-Based Speaker Recognition [J] . Ferras M., Cheung-Chi Leung, Barras C., Audio, Speech, and Language Processing, IEEE Transactions on . 2010,第6期

机译：基于SVM的说话人识别中说话人自适应方法作为特征提取的比较
3. Feature Extraction Methods for Speaker Recognition: A Review [J] . Chaudhary Gopal, Srivastava Smriti, Bhardwaj Saurabh International Journal of Pattern Recognition and Artificial Intelligence . 2017,第12期

机译：说话人识别的特征提取方法综述
4. Feature extraction and classification techniques for speaker recognition: A review [C] . Dhameliya Kinnal, Bhatt Ninad International Conference on Electrical, Electronics, Signals, Communication and Optimization . 2015

机译：用于说话人识别的特征提取和分类技术：综述
5. Physiologically-motivated feature extraction methods for speaker recognition. [D] . Wang, Jianglin. 2013

机译：用于说话人识别的生理动机特征提取方法。
6. Research of Recognition Method of Discrete Wavelet Feature Extraction and PNN Classification of Rats FT-IR Pancreatic Cancer Data [O] . Chayan Wan, Wenqing Cao, Cungui Cheng 2014

机译：大鼠FT-IR胰腺癌数据离散小波特征提取与PNN分类识别方法研究
7. Comparison of Speaker Adaptation Methods as Feature Extraction for SVM-Based Speaker Recognition [O] . M Ferras, C Barras, J.-L. Gauvain 2010

机译：扬声器适应方法与基于SVM的扬声器识别特征提取的比较
8. Classification Methods for Speaker Recognition. [R] . Sturim, D. E., Campbell, W. M., Reynolds, D. A. 2005

机译：说话人识别的分类方法。

A new optimum feature extraction and classification method for speaker recognition: GWPNN

摘要

著录项

相似文献

相关主题

期刊订阅