A tree-based kernel selection approach to efficient Gaussian mixture model–universal background model based speaker identification

Zhenyu Xiong; Thomas Fang Zheng; Zhanjiang Song; Frank Soong; Wenhu Wu

首页> 外文期刊>Speech Communication >A tree-based kernel selection approach to efficient Gaussian mixture model–universal background model based speaker identification

【24h】

A tree-based kernel selection approach to efficient Gaussian mixture model–universal background model based speaker identification

机译：基于树的核选择方法基于高斯混合模型-基于通用背景模型的说话人识别

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

We propose a tree-based kernel selection (TBKS) algorithm as a computationally efficient approach to the Gaussian mixture model–universal background model (GMM–UBM) based speaker identification. All Gaussian components in the universal background model are first clustered hierarchically into a tree and the corresponding acoustic space is mapped into structurally partitioned regions. When identifying a speaker, each test input feature vector is scored against a small subset of all Gaussian components. As a result of this TBKS process, computation complexity can be significantly reduced. We improve the efficiency of the proposed system further by applying a previously proposed observation reordering based pruning (ORBP) to screen out unlikely candidate speakers. The approach is evaluated on a speech database of 1031 speakers, in both clean and noisy conditions. The experimental results show that by integrating TBKS and ORBP together we can speed up the computation efficiency by a factor of 15.8 with only a very slight degradation of identification performance, i.e., an increase of 1% of relative error rate, compared with a baseline GMM–UBM system. The improved search efficiency is also robust to additive noise.

机译：我们提出了一种基于树的核选择（TBKS）算法，作为基于高斯混合模型-通用背景模型（GMM-UBM）的说话人识别的一种高效计算方法。通用背景模型中的所有高斯分量首先被层次化地聚集成一棵树，并且相应的声学空间被映射到结构上划分的区域。识别说话者时，将对所有高斯成分的一小部分对每个测试输入特征向量进行评分。 TBKS处理的结果是，可以显着降低计算复杂度。我们通过应用以前提出的基于观察重排序的修剪（ORBP）来筛选不太可能的候选发言者，从而进一步提高了提出的系统的效率。在干净和嘈杂的条件下，该方法在1031个扬声器的语音数据库中进行了评估。实验结果表明，通过将TBKS和ORBP集成在一起，我们可以将计算效率提高15.8倍，而识别性能却只有很小的下降，即与基线GMM相比，相对错误率提高了1％ –UBM系统。改进的搜索效率还可以抵抗加性噪声。

著录项

来源
《Speech Communication》 |2006年第10期|p. 1273-1282|共10页
作者
Zhenyu Xiong; Thomas Fang Zheng; Zhanjiang Song; Frank Soong; Wenhu Wu;
展开▼
作者单位

Center for Speech Technology, State Key Laboratory of Intelligent Technology and Systems, Department of Computer Science and Technology, Tsinghua University, Beijing 100084, China;

Center for Speech Technology, State Key Laboratory of Intelligent Technology and Systems, Department of Computer Science and Technology, Tsinghua University, Beijing 100084, China;

Beijing d-Ear Technologies Co., Ltd,1 China;

Microsoft Research Asia,2 China;

Center for Speech Technology, State Key Laboratory of Intelligent Technology and Systems, Department of Computer Science and Technology, Tsinghua University, Beijing 100084, China;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类语言、文字;
关键词
Speaker recognition; Speaker identification; Tree-based kernel selection; GMM–UBM;

机译：说话人识别;说话人识别;基于树的内核选择;GMM–UBM;

相似文献

外文文献
中文文献
专利

1. A Model-Selection-Based Self-Splitting Gaussian Mixture Learning with Application to Speaker Identification [J] . Shih-Sian Cheng, Hsin-Min Wang, Hsin-Chia Fu EURASIP journal on advances in signal processing . 2004,第17期

机译：基于模型选择的自分裂高斯混合学习及其在说话人识别中的应用
2. An efficient scoring algorithm for Gaussian mixture model based speaker identification [J] . Pellom B.L., Hansen J.H.L. IEEE signal processing letters . 1998,第11期

机译：基于说话人识别的高斯混合模型的高效计分算法
3. Spoken Language Identification using Gaussian Mixture Model-Universal Background Model in Indian Context [J] . Sreedhar Potla, Vishnu Vardhan B. International Journal of Applied Engineering Research . 2018,第5aPta4期

机译：在印度语境中使用高斯混合模型 - 通用背景模型的口语语言识别
4. Automatic language identification based on Gaussian mixture model and universal background model [C] . Dan Qu, Bingxi Wang, Xin Wei Multispectral Image Processing and Pattern Recognition . 2003

机译：基于高斯混合模型和通用背景模型的语言自动识别
5. A software based speaker identification system using Gaussian mixture model classification. [D] . Reynolds, Ryan M. 2005

机译：使用高斯混合模型分类的基于软件的说话人识别系统。
6. Comparison of neuron-based, kernel-based, tree-based and curve-based machine learning models for predicting daily reference evapotranspiration [O] . Lifeng Wu, Junliang Fan 2015

机译：基于神经元，基于核，基于树和基于曲线的机器学习模型的比较，以预测每日参考蒸散量
7. Tree-based Gaussian mixture models for speaker verification [O] . Cilliers Francois Dirk 2005

机译：用于说话人验证的基于树的高斯混合模型
8. Efficient Speaker Verification Using Gaussian Mixture Model Component Clustering. [R] . McClanahan, R. D., De Leon, P. L. 2012

机译：使用高斯混合模型组件聚类的高效说话人验证。

A tree-based kernel selection approach to efficient Gaussian mixture model–universal background model based speaker identification

摘要

著录项

相似文献

相关主题

期刊订阅