首页> 外文期刊>Journal of Circuits, Systems, and Computers >NOVEL DISCRIMINATIVE VECTOR QUANTIZATION APPROACH FOR SPEAKER IDENTIFICATION
【24h】

NOVEL DISCRIMINATIVE VECTOR QUANTIZATION APPROACH FOR SPEAKER IDENTIFICATION

机译:说话人识别的新型区分矢量量化方法

获取原文
获取原文并翻译 | 示例
           

摘要

A novel Discriminative Vector Quantization method for Speaker Identification (DVQSI) is proposed, and its parameters selection is discussed. In the training mode of this approach, the vector space of speech features is divided into a number of regions. Then, a Vector Quantization (VQ) codebook for each speaker in each region is constructed. For every possible combination of speaker pairs, a discriminative weight is assigned for each region, based on the region's ability to discriminate between the speaker pair. Consequently, the region, which contains a larger distribution difference between the speech feature vector sets of the two speakers in the speaker pair, plays a more important role by assigning it a larger discriminative weight, in identifying the better speaker match from the two speakers. In the testing mode, to identify an unknown speaker, discriminative weighted average VQ distortion pairs are computed for the unknown speaker input waveform. Then, a technique is described that figures out the best match between the unknown waveform and speakers' templates. The proposed DVQSI approach can be considered a generalization of the existing VQ technique for Speaker Identification (VQSI). The method presented here yields better Speaker Identification (SI) accuracy by employing the discriminative weights and space segmentation as design parameters. This is confirmed experimentally. In addition, a computationally efficient implementation of the DVQSI technique is given which uses a tree-structured-like approach to obtain the codebooks.
机译:提出了一种新的用于说话人识别的判别矢量量化方法(DVQSI),并讨论了其参数选择。在这种方法的训练模式中,语音特征的向量空间被划分为多个区域。然后,为每个区域中的每个扬声器构建一个矢量量化(VQ)码本。对于说话人对的每种可能组合,根据区域区分说话人对的能力,为每个区域分配判别权重。因此,在说话者对中两个说话者的语音特征向量组之间包含较大分布差异的区域,通过为它分配更大的判别权重,在从两个说话者中识别出更好的说话者匹配中起着更重要的作用。在测试模式下,为识别未知扬声器,将为未知扬声器输入波形计算判别加权平均VQ失真对。然后,描述一种找出未知波形和扬声器模板之间最佳匹配的技术。可以将提出的DVQSI方法视为对说话人识别(VQSI)的现有VQ技术的概括。通过采用区分权重和空间分段作为设计参数,此处介绍的方法可产生更好的说话人识别(SI)准确性。实验证实了这一点。另外,给出了DVQSI技术的计算有效实现,该技术使用树状结构方法来获得码本。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号