首页> 外文期刊>Biometrics, IET >Speaker identification using multimodal neural networks and wavelet analysis
【24h】

Speaker identification using multimodal neural networks and wavelet analysis

机译:使用多模态神经网络和小波分析的说话人识别

获取原文
获取原文并翻译 | 示例
       

摘要

The rapid momentum of the technology progress in the recent years has led to a tremendous rise in the use of biometric authentication systems. The objective of this research is to investigate the problem of identifying a speaker from its voice regardless of the content. In this study, the authors designed and implemented a novel text-independent multimodal speaker identification system based on wavelet analysis and neural networks. Wavelet analysis comprises discrete wavelet transform, wavelet packet transform, wavelet sub-band coding and Mel-frequency cepstral coefficients (MFCCs). The learning module comprises general regressive, probabilistic and radial basis function neural networks, forming decisions through a majority voting scheme. The system was found to be competitive and it improved the identification rate by 15% as compared with the classical MFCC. In addition, it reduced the identification time by 40% as compared with the back-propagation neural network, Gaussian mixture model and principal component analysis. Performance tests conducted using the GRID database corpora have shown that this approach has faster identification time and greater accuracy compared with traditional approaches, and it is applicable to real-time, text-independent speaker identification systems.
机译:近年来,技术进步的迅速发展导致生物识别系统的使用大大增加。这项研究的目的是研究从语音中识别说话者而不考虑其内容的问题。在这项研究中,作者设计和实现了一种基于小波分析和神经网络的新型独立于文本的多模式说话人识别系统。小波分析包括离散小波变换,小波包变换,小波子带编码和梅尔频率倒谱系数(MFCC)。学习模块包括通用回归,概率和径向基函数神经网络,通过多数表决方案形成决策。该系统具有竞争优势,与经典MFCC相比,它的识别率提高了15%。此外,与反向传播神经网络,高斯混合模型和主成分分析相比,它可以将识别时间减少40%。使用GRID数据库语料库进行的性能测试表明,与传统方法相比,该方法具有更快的识别时间和更高的准确性,并且适用于实时的,独立于文本的说话者识别系统。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号