Binary quantization of feature vectors for robust text-independent speaker identification

Zhong-Xuan Yuan; Bo-Ling Xu

首页> 外文期刊>IEEE Transactions on Speech and Audio Proceeding >Binary quantization of feature vectors for robust text-independent speaker identification

【24h】

Binary quantization of feature vectors for robust text-independent speaker identification

机译：特征向量的二进制量化，用于鲁棒的与文本无关的说话人识别

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

We present a novel approach to vector quantization in which a feature vector is represented by a binary vector. It is called binary quantization (BQ). The performance criterion of vector quantization, distortion (distance) measure, was employed for investigating the effectiveness of BQ. At 12 b/analysis frame, the average distortion caused by BQ is even lower than the intraspeaker average distance between two repetitions of the same word (after DTW alignment). Since the output of BQ is a binary sequence, it is possible to combine it with a forward Hamming net classifier. In terms of the idea of a hierarchical model for describing a speaker individual characteristics, a text-independent speaker identification system was set up. Experimental results show that the performance of this system is very good. Not only are the small memory space and little computation required, in the speaker identification system, but, more importantly, it shows strong robustness in additive Gaussian white noise.

机译：我们提出了一种新颖的矢量量化方法，其中特征矢量由二进制矢量表示。这称为二进制量化（BQ）。使用矢量量化，失真（距离）度量的性能标准来研究BQ的有效性。在12 b /分析帧下，由BQ引起的平均失真甚至低于同一单词的两次重复之间的扬声器内平均距离（DTW对齐后）。由于BQ的输出是二进制序列，因此可以将其与正向汉明网络分类器组合。根据用于描述说话者个人特征的分层模型的思想，建立了独立于文本的说话者识别系统。实验结果表明，该系统的性能非常好。说话人识别系统不仅需要很小的存储空间和很少的计算量，而且更重要的是，它在加性高斯白噪声中显示出强大的鲁棒性。

著录项

来源
《IEEE Transactions on Speech and Audio Proceeding》 |1999年第1期|P.70-78|共9页
作者
Zhong-Xuan Yuan; Bo-Ling Xu;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类电声技术和语音信号处理;
关键词

相似文献

外文文献
中文文献
专利

1. Binary quantization of feature vectors for robust text-independentspeaker identification [J] . Zhong-Xuan Yuan, Bo-Ling Xu, Chong-Zhi Yu IEEE Transactions on Speech and Audio Proceessing . 1999,第1期

机译：特征向量的二进制量化，可实现可靠的与文本无关的说话人识别
2. Text-independent speaker identification based on selection of the most similar feature vectors [J] . Mohammad Soleymanpour, Hossein Marvi International journal of speech technology . 2017,第1期

机译：基于最相似特征向量的选择的与文本无关的说话人识别
3. Adaptive wavelet thresholding with robust hybrid features for text-independent speaker identification system [J] . Hesham A. Alabbasi, Ali M. Jalil, Fadhil S. Hasan International Journal of Electrical and Computer Engineering . 2020,第5期

机译：具有鲁棒混合特性的自适应小波阈值，用于独立于文本的扬声器识别系统
4. Hybridization process for text-independent speaker identification based on vector quantization model [C] . Mohammed Djeghader, Qin Huang 2016 IEEE International Conference on Signal and Image Processing . 2016

机译：基于矢量量化模型的文本无关说话人识别的混合过程
5. Robust features for speaker identification. [D] . Assaleh, Khaled Talal. 1993

机译：强大的扬声器识别功能。
6. Towards understanding speaker discrimination abilities in humans and machines for text-independent short utterances of different speech styles [O] . Soo Jin Park, Gary Yeung, Neda Vesselinova, -1

机译：旨在理解人和机器中说话者的辨别能力以实现不同语音风格的与文本无关的简短发声
7. Comparison of Vector Quantization and Gaussian Mixture Model using Effective MFCC Features for Text-independent Speaker Identification [O] . S. B., S. M. 2016

机译：矢量量化与高斯混合模型使用有效MFCC特征对文本无关的扬声器识别的比较

Binary quantization of feature vectors for robust text-independent speaker identification

摘要

著录项

相似文献

相关主题

期刊订阅