Robust Speaker Identification System Based on Multilayer Eigen-Codebook Vector Quantization

Ching-Tang HSIEH; Eugene LAI; Wan-Chen CHEN

首页> 外文期刊>IEICE Transactions on Information and Systems >Robust Speaker Identification System Based on Multilayer Eigen-Codebook Vector Quantization

【24h】

Robust Speaker Identification System Based on Multilayer Eigen-Codebook Vector Quantization

机译：基于多层特征码矢量量化的鲁棒说话人识别系统

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

This paper presents some effective methods for improving the performance of a speaker identification system. Based on the mul-tiresolution property of the wavelet transform, the input speech signal is decomposed into various frequency subbands in order not to spread noise distortions over the entire feature space. For capturing the characteristics of the vocal tract, the linear predictive cepstral coefficients (LPCC) of the lower frequency subband for each decomposition process are calculated. In addition, a hard threshold technique for the lower frequency subband in each decomposition process is also applied to eliminate the effect of noise interference. Furthermore, cepstral domain feature vector normalization is applied to all computed features in order to provide similar parameter statistics in all acoustic environments. In order to effectively utilize all these multiband speech features, we propose a modified vector quantization as the identifier. This model uses the multilayer concept to eliminate the interference among the multiband speech features and then uses the principal component analysis (PCA) method to evaluate the codebooks for capturing a more detailed distribution of the speaker's phoneme characteristics. The proposed method is evaluated using the KING speech database for text-independent speaker identification. Experimental results show that the recognition performance of the proposed method is better than those of the vector quantization (VQ) and the Gaussian mixture model (GMM) using full-band LPCC and mel-frequency cepstral coefficients (MFCC) features in both clean and noisy environments. Also, a satisfactory performance can be achieved in low SNR environments.

机译：本文提出了一些有效的方法来提高说话人识别系统的性能。基于小波变换的多分辨率特性，为了不将噪声失真扩展到整个特征空间上，将输入语音信号分解为各种频率子带。为了捕获声道的特征，计算每个分解过程的低频子带的线性预测倒谱系数（LPCC）。此外，在每个分解过程中，还针对较低频率子带采用了硬阈值技术，以消除噪声干扰的影响。此外，倒谱域特征向量归一化应用于所有计算出的特征，以便在所有声学环境中提供相似的参数统计信息。为了有效利用所有这些多频带语音特征，我们提出了一种改进的矢量量化作为标识符。该模型使用多层概念来消除多频带语音特征之间的干扰，然后使用主成分分析（PCA）方法评估码本，以捕获说话人音素特征的更详细分布。使用KING语音数据库对所提出的方法进行评估，以实现与文本无关的说话人识别。实验结果表明，该方法在纯净噪声和噪声噪声方面均优于矢量量化（VQ）和高斯混合模型（GMM）的全频带LPCC和梅尔频率倒谱系数（MFCC）的识别性能。环境。此外，在低SNR环境中也可以实现令人满意的性能。

著录项

来源
《IEICE Transactions on Information and Systems》 |2004年第5期|p.1185-1193|共9页
作者
Ching-Tang HSIEH; Eugene LAI; Wan-Chen CHEN;
展开▼
作者单位

Department of Electrical Engineering, Tamkang University, 151 Ying-chuan Road, Tamsui, Taipei County, Taiwan 25137, Republic of China;

展开▼
收录信息美国《科学引文索引》(SCI);美国《工程索引》(EI);
原文格式 PDF
正文语种 eng
中图分类无线电电子学、电信技术;
关键词
speaker identification; vector quantization; eigenvector; principal component analysis; multilayer eigen-codebook vector quantization (MLECVQ);

机译：说话人识别;向量量化;特征向量;主成分分析;多层特征码本向量量化（MLECVQ）;
入库时间 2022-08-18 00:30:22

相似文献

外文文献
中文文献
专利

1. Robust Speaker Identification System Based On Two-stage Vector Quantization [J] . Wan-Chen Chen, Ching-Tang Hsieh, Chih-Hsu Hsu Tamkang Journal of Science and Engineering . 2008,第4期

机译：基于两阶段矢量量化的鲁棒说话人识别系统
2. Binary quantization of feature vectors for robust text-independent speaker identification [J] . Zhong-Xuan Yuan, Bo-Ling Xu IEEE Transactions on Speech and Audio Proceeding . 1999,第1期

机译：特征向量的二进制量化，用于鲁棒的与文本无关的说话人识别
3. Robust speaker identification based on selective use of feature vectors [J] . Soonil Kwon, Shrikanth Narayanan Pattern recognition letters . 2007,第1期

机译：基于选择性使用特征向量的健壮说话人识别
4. Discrete Fractional Fourier Transform and Vector Quantization Based Speaker Identification System [C] . Walia Mandeep Singh International Conference on Advanced Computing and Communication Technologies . 2014

机译：基于离散分数阶傅里叶变换和矢量量化的说话人识别系统
5. CASA-based robust speaker identification. [D] . Zhao, Xiaojia. 2014

机译：基于CASA的健壮说话人识别。
6. A Robust Speaker Identification System Using the Responses from a Model of the Auditory Periphery [O] . Md. Atiqul Islam, Wissam A. Jassim, Ng Siew Cheok, -1

机译：强大的说话人识别系统利用听觉外围模型的响应
7. Automatic speaker recognition dynamic feature identification and classification using distributed discrete cosine transform based mel frequency cepstral coefficients and fuzzy vector quantization [O] . Hossan M 2011

机译：基于分布式离散余弦变换的梅尔频率倒谱系数和模糊矢量量化自动说话人识别动态特征识别与分类
8. Integrated Robust Open-Set Speaker Identification System (IROSIS). [R] . Q. Jin Y. Wang 2012

机译：集成的鲁棒开放式扬声器识别系统（IROsIs）。

Robust Speaker Identification System Based on Multilayer Eigen-Codebook Vector Quantization

摘要

著录项

相似文献

相关主题

期刊订阅