...
首页> 外文期刊>Circuits, systems, and signal processing >Text-Independent Speaker Recognition in Clean and Noisy Backgrounds Using Modified VQ-LBG Algorithm
【24h】

Text-Independent Speaker Recognition in Clean and Noisy Backgrounds Using Modified VQ-LBG Algorithm

机译:使用修改的VQ-LBG算法在清洁和嘈杂的背景中独立于文本的扬声器识别

获取原文
获取原文并翻译 | 示例

摘要

Speaker recognition is the process of identifying the proper speaker by analyzing the spectral shape of the speech signal. This process is done by extracting the desired features and matching the features of the speech signal. In this paper, we adopted the Mel frequency cepstrum coefficient (MFCC) technique for extracting the features from the speaker speech sample. These cepstrum coefficients are named as extracted features. The extracted MFCC features are given as input to the modified vector quantization via Linde-Buzo-Gray (modified VQ-LBG) process and expectation maximization (EM) algorithm. Vector quantization technique is mainly used for feature matching where a separate codebook will be generated for each speaker. The EM algorithm is utilized to develop the Gaussian mixture model-universal background model (GMM-UBM). In GMM-UBM model, k means cluster is summed up to consolidate data about the covariance structure of the information and the focuses of the inert Gaussians. From our analysis, the modified VQ-LBG algorithm gives better performance compared to the GMM-UBM model.
机译:扬声器识别是通过分析语音信号的光谱形状来识别适当扬声器的过程。通过提取所需特征并匹配语音信号的特征来完成该过程。在本文中,我们采用MEL频率谱系码(MFCC)技术从扬声器语音样本中提取特征。这些Cepstrum系数被命名为提取的功能。提取的MFCC特征通过Linde-Buzo-灰度(修改的VQ-LBG)处理和期望最大化(EM)算法作为改进的矢量量化的输入。向量量化技术主要用于特征匹配,其中将为每个扬声器生成单独的码本。利用EM算法开发高斯混合模型 - 通用背景模型(GMM-UBM)。在GMM-UBM模型中,k表示集群总结,以巩固关于信息协方差结构的数据和惰性高斯的焦点。从我们的分析中,与GMM-UBM模型相比,修改的VQ-LBG算法提供了更好的性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号