NOVEL DISCRIMINATIVE VECTOR QUANTIZATION APPROACH FOR SPEAKER IDENTIFICATION

GUANGYU ZHOU; WASFY B. MIKHAEL; BRENT MYERS

首页> 外文期刊>Journal of Circuits, Systems, and Computers >NOVEL DISCRIMINATIVE VECTOR QUANTIZATION APPROACH FOR SPEAKER IDENTIFICATION

【24h】

NOVEL DISCRIMINATIVE VECTOR QUANTIZATION APPROACH FOR SPEAKER IDENTIFICATION

机译：说话人识别的新型区分矢量量化方法

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

A novel Discriminative Vector Quantization method for Speaker Identification (DVQSI) is proposed, and its parameters selection is discussed. In the training mode of this approach, the vector space of speech features is divided into a number of regions. Then, a Vector Quantization (VQ) codebook for each speaker in each region is constructed. For every possible combination of speaker pairs, a discriminative weight is assigned for each region, based on the region's ability to discriminate between the speaker pair. Consequently, the region, which contains a larger distribution difference between the speech feature vector sets of the two speakers in the speaker pair, plays a more important role by assigning it a larger discriminative weight, in identifying the better speaker match from the two speakers. In the testing mode, to identify an unknown speaker, discriminative weighted average VQ distortion pairs are computed for the unknown speaker input waveform. Then, a technique is described that figures out the best match between the unknown waveform and speakers' templates. The proposed DVQSI approach can be considered a generalization of the existing VQ technique for Speaker Identification (VQSI). The method presented here yields better Speaker Identification (SI) accuracy by employing the discriminative weights and space segmentation as design parameters. This is confirmed experimentally. In addition, a computationally efficient implementation of the DVQSI technique is given which uses a tree-structured-like approach to obtain the codebooks.

机译：提出了一种新的用于说话人识别的判别矢量量化方法（DVQSI），并讨论了其参数选择。在这种方法的训练模式中，语音特征的向量空间被划分为多个区域。然后，为每个区域中的每个扬声器构建一个矢量量化（VQ）码本。对于说话人对的每种可能组合，根据区域区分说话人对的能力，为每个区域分配判别权重。因此，在说话者对中两个说话者的语音特征向量组之间包含较大分布差异的区域，通过为它分配更大的判别权重，在从两个说话者中识别出更好的说话者匹配中起着更重要的作用。在测试模式下，为识别未知扬声器，将为未知扬声器输入波形计算判别加权平均VQ失真对。然后，描述一种找出未知波形和扬声器模板之间最佳匹配的技术。可以将提出的DVQSI方法视为对说话人识别（VQSI）的现有VQ技术的概括。通过采用区分权重和空间分段作为设计参数，此处介绍的方法可产生更好的说话人识别（SI）准确性。实验证实了这一点。另外，给出了DVQSI技术的计算有效实现，该技术使用树状结构方法来获得码本。

著录项

来源
《Journal of Circuits, Systems, and Computers》 |2005年第3期|p.581-596|共16页
作者
GUANGYU ZHOU; WASFY B. MIKHAEL; BRENT MYERS;
展开▼
作者单位

Department of Electrical and Computer Engineering, University of Central Florida, Orlando, FL 32826, USA;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类计算技术、计算机技术;
关键词
speaker identification; vector quantization; discriminative weight; feature space segmentation;

机译：说话人识别;矢量量化;判别权重;特征空间分割;

相似文献

外文文献
中文文献
专利

1. Binary quantization of feature vectors for robust text-independent speaker identification [J] . Zhong-Xuan Yuan, Bo-Ling Xu IEEE Transactions on Speech and Audio Proceeding . 1999,第1期

机译：特征向量的二进制量化，用于鲁棒的与文本无关的说话人识别
2. Speaker identification based on adaptive discriminative vector quantisation [J] . G. Zhou, W.B. Mikhael IEE proceedings, Part K. Vision, image and signal processing . 2006,第6期

机译：基于自适应判别矢量量化的说话人识别
3. Calculating Model Parameters Using Gaussian Mixture Models Based on Vector Quantization in Speaker Identification [J] . Hamideh Rezaei-Nezhad International journal of computer science and network security . 2017,第2期

机译：基于矢量量化的高斯混合模型在说话人识别中的模型参数计算
4. Analysis of Discriminative Vector Quantization Approach for Speaker Identification [C] . GUANGYU ZHOU, WASFY B. MIKHAEL 8th World Multi-Conference on Systemics, Cybernetics and Informatics(SCI 2004) vol.6: Image, Acoustic, Signal Processing and Optical Systems, Technologies and Applications . 2004

机译：说话人识别的判别矢量量化方法分析
5. Classification and compression of multi-resolution vectors: A tree structured vector quantizer approach. [D] . Varma, Sudhir. 2002

机译：多分辨率矢量的分类和压缩：一种树结构矢量量化器方法。
6. A Gabor-Block-Based Kernel Discriminative Common Vector Approach Using Cosine Kernels for Human Face Recognition [O] . Arindam Kar, Debotosh Bhattacharjee, Dipak Kumar Basu, 2012

机译：基于余弦核的人脸识别的基于Gabor块的核判别通用向量方法
7. A discriminative approach for speaker selection in speaker de-identification systems [O] . Abou-Zleikha, Mohamed, Tan, Zheng-Hua, Christensen, Mads Græsbøll, 2015

机译：扬声器去识别系统中扬声器选择的判别方法
8. Text-Dependent Speaker Verification Using Vector Quantization Source Coding [R] . Burton, D. K. 1985

机译：使用矢量量化源编码的文本相关说话人验证

NOVEL DISCRIMINATIVE VECTOR QUANTIZATION APPROACH FOR SPEAKER IDENTIFICATION

摘要

著录项

相似文献

相关主题

期刊订阅