首页> 外文期刊>IEEE Transactions on Speech and Audio Proceeding >Binary quantization of feature vectors for robust text-independent speaker identification
【24h】

Binary quantization of feature vectors for robust text-independent speaker identification

机译:特征向量的二进制量化,用于鲁棒的与文本无关的说话人识别

获取原文
获取原文并翻译 | 示例
           

摘要

We present a novel approach to vector quantization in which a feature vector is represented by a binary vector. It is called binary quantization (BQ). The performance criterion of vector quantization, distortion (distance) measure, was employed for investigating the effectiveness of BQ. At 12 b/analysis frame, the average distortion caused by BQ is even lower than the intraspeaker average distance between two repetitions of the same word (after DTW alignment). Since the output of BQ is a binary sequence, it is possible to combine it with a forward Hamming net classifier. In terms of the idea of a hierarchical model for describing a speaker individual characteristics, a text-independent speaker identification system was set up. Experimental results show that the performance of this system is very good. Not only are the small memory space and little computation required, in the speaker identification system, but, more importantly, it shows strong robustness in additive Gaussian white noise.
机译:我们提出了一种新颖的矢量量化方法,其中特征矢量由二进制矢量表示。这称为二进制量化(BQ)。使用矢量量化,失真(距离)度量的性能标准来研究BQ的有效性。在12 b /分析帧下,由BQ引起的平均失真甚至低于同一单词的两次重复之间的扬声器内平均距离(DTW对齐后)。由于BQ的输出是二进制序列,因此可以将其与正向汉明网络分类器组合。根据用于描述说话者个人特征的分层模型的思想,建立了独立于文本的说话者识别系统。实验结果表明,该系统的性能非常好。说话人识别系统不仅需要很小的存储空间和很少的计算量,而且更重要的是,它在加性高斯白噪声中显示出强大的鲁棒性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号