Speaker identification using multimodal neural networks and wavelet analysis

Almaadeed Noor; Aggoun Amar; Amira Abbes

首页> 外文期刊>Biometrics, IET >Speaker identification using multimodal neural networks and wavelet analysis

【24h】

Speaker identification using multimodal neural networks and wavelet analysis

机译：使用多模态神经网络和小波分析的说话人识别

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

The rapid momentum of the technology progress in the recent years has led to a tremendous rise in the use of biometric authentication systems. The objective of this research is to investigate the problem of identifying a speaker from its voice regardless of the content. In this study, the authors designed and implemented a novel text-independent multimodal speaker identification system based on wavelet analysis and neural networks. Wavelet analysis comprises discrete wavelet transform, wavelet packet transform, wavelet sub-band coding and Mel-frequency cepstral coefficients (MFCCs). The learning module comprises general regressive, probabilistic and radial basis function neural networks, forming decisions through a majority voting scheme. The system was found to be competitive and it improved the identification rate by 15% as compared with the classical MFCC. In addition, it reduced the identification time by 40% as compared with the back-propagation neural network, Gaussian mixture model and principal component analysis. Performance tests conducted using the GRID database corpora have shown that this approach has faster identification time and greater accuracy compared with traditional approaches, and it is applicable to real-time, text-independent speaker identification systems.

机译：近年来，技术进步的迅速发展导致生物识别系统的使用大大增加。这项研究的目的是研究从语音中识别说话者而不考虑其内容的问题。在这项研究中，作者设计和实现了一种基于小波分析和神经网络的新型独立于文本的多模式说话人识别系统。小波分析包括离散小波变换，小波包变换，小波子带编码和梅尔频率倒谱系数（MFCC）。学习模块包括通用回归，概率和径向基函数神经网络，通过多数表决方案形成决策。该系统具有竞争优势，与经典MFCC相比，它的识别率提高了15％。此外，与反向传播神经网络，高斯混合模型和主成分分析相比，它可以将识别时间减少40％。使用GRID数据库语料库进行的性能测试表明，与传统方法相比，该方法具有更快的识别时间和更高的准确性，并且适用于实时的，独立于文本的说话者识别系统。

著录项

来源
《Biometrics, IET》 |2015年第1期|18-28|共11页
作者
Almaadeed Noor; Aggoun Amar; Amira Abbes;
展开▼
作者单位

Dept. of Comput. Eng., Brunel Univ., Uxbridge, UK;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Gaussian processes; audio databases; backpropagation; biometrics (access control); cepstral analysis; discrete wavelet transforms; mixture models; principal component analysis; radial basis function networks; speaker recognition; text analysis; GRID database corpora; Gaussian mixture model; MFCC; Mel-frequency cepstral coefficients; back-propagation neural network; biometric authentication systems; discrete wavelet transform; general regressive neural networks; learning module; majority voting scheme; multimodal neural networks; principal component analysis; probabilistic neural networks; radial basis function neural networks; text-independent multimodal speaker identification system; wavelet analysis; wavelet packet transform; wavelet subband coding;

机译：高斯过程;音频数据库;反向传播;生物计量学（访问控制）;倒谱分析;离散小波变换;混合模型;主成分分析;径向基函数网络;扬声器识别;文本分析;GRID数据库语料库;高斯混合模型;MFCC;Mel频倒谱系数;反向传播神经网络;生物认证系统;离散小波变换;通用回归神经网络;学习模块;多数投票方案;多峰神经网络;主成分分析;概率神经网络;径向基函数神经网络;文本独立多模态说话人识别系统;小波分析;小波包变换;小波子带编码;
入库时间 2022-08-17 23:56:18

相似文献

外文文献
中文文献
专利

1. Speaker identification using neural networks and wavelets [J] . Phan F., Micheli-Tzanakou E. IEEE Engineering in Medicine and Biology Magazine . 2000,第1期

机译：使用神经网络和小波进行说话人识别
2. Speaker identification using vowels features through a combined method of formants, wavelets, and neural network classifiers [J] . Daqrouq Khaled, Tutunji Tarek A. Applied Soft Computing . 2015,第Null期

机译：通过共振峰，小波和神经网络分类器的组合方法，使用元音特征识别说话人
3. The Use of Wavelets in Speaker Feature Tracking Identification System Using Neural Network [J] . WAEL AL-SAWALMEH, KHALED DAQROUQ, ABDEL-RAHMAN AL-QAWASMI, WSEAS Transactions on Signal Processing . 2009,第4a6期

机译：小波在神经网络说话人特征跟踪识别系统中的应用
4. Speaker identification with wavelet decomposition and neural networks [C] . Phan, F., Micheli-Tzanakou, . 1994

机译：小波分解与神经网络的说话人识别
5. Analysis of fetal heart rate variability and stress conditions in fetuses using wavelets and neural networks (ALOPEX): A feasibility study. [D] . Akay, Yasemin Munevver. 1999

机译：利用小波和神经网络（ALOPEX）分析胎儿的胎儿心率变异性和应激状况：一项可行性研究。
6. Atrial Fibrillation Beat Identification Using the Combination of Modified Frequency Slice Wavelet Transform and Convolutional Neural Networks [O] . Xiaoyan Xu, Shoushui Wei, Caiyun Ma, 2018

机译：结合改进的频率切片小波变换和卷积神经网络的心房颤动搏动识别
7. Speaker identification using multimodal neural networks and wavelet analysis [O] . Aggoun, Amar, Almaadeed, Noor, Amira, Abbes 2015

机译：使用多模态神经网络和小波分析的说话人识别
8. Genetically Optimised Feedforward Neural Networks for Speaker Identification [R] . Price, R. , Willmore, J. , Roberts, W. 1999

机译：用于说话人识别的遗传优化前馈神经网络

Speaker identification using multimodal neural networks and wavelet analysis

摘要

著录项

相似文献

相关主题

期刊订阅