首页> 外文会议>International Joint Conference on Natural Language Processing >Robust Speaker Identification System Based on Wavelet Transform and Gaussian Mixture Model
【24h】

Robust Speaker Identification System Based on Wavelet Transform and Gaussian Mixture Model

机译:基于小波变换和高斯混合模型的强大扬声器识别系统

获取原文

摘要

This paper presents an effective method for improving the performance of a speaker identification system. Based on the multiresolution property of the wavelet transform, the input speech signal is decomposed into various frequency bands in order not to spread noise distortions over the entire feature space. The linear predictive cepstral coefficients (LPCCs) of each band are calculated. Furthermore, the cepstral mean normalization technique is applied to all computed features. We use feature recombination and likelihood recombination methods to evaluate the task of the text-independent speaker identification. The feature recombination scheme combines the cepstral coefficients of each band to form a single feature vector used to train the Gaussian mixture model (GMM). The likelihood recombination scheme combines the likelihood scores of independent GMM for each band. Experimental results show that both proposed methods outperform the GMM model using full-band LPCCs and mel-frequency cepstral coefficients (MFCCs) in both clean and noisy environments.
机译:本文介绍了提高扬声器识别系统性能的有效方法。基于小波变换的多分辨率特性,输入语音信号被分解成各种频带,以便在整个特征空间上不扩展噪声失真。计算每个频带的线性预测临床系数(LPCC)。此外,临床均值归一化技术应用于所有计算的特征。我们使用特征重组和似然重组方法来评估独立于文本的扬声器识别的任务。特征重组方案组合每个频带的倒谱系数以形成用于训练高斯混合模型(GMM)的单个特征向量。似然重组方案结合了每个频段的独立GMM的可能性分数。实验结果表明,两个提出的方法在干净和嘈杂的环境中使用全带LPCC和MET频率患者系数(MFCC)优于GMM模型。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号