首页> 外文期刊>Speech Communication >A tree-based kernel selection approach to efficient Gaussian mixture model–universal background model based speaker identification
【24h】

A tree-based kernel selection approach to efficient Gaussian mixture model–universal background model based speaker identification

机译:基于树的核选择方法基于高斯混合模型-基于通用背景模型的说话人识别

获取原文
获取原文并翻译 | 示例
           

摘要

We propose a tree-based kernel selection (TBKS) algorithm as a computationally efficient approach to the Gaussian mixture model–universal background model (GMM–UBM) based speaker identification. All Gaussian components in the universal background model are first clustered hierarchically into a tree and the corresponding acoustic space is mapped into structurally partitioned regions. When identifying a speaker, each test input feature vector is scored against a small subset of all Gaussian components. As a result of this TBKS process, computation complexity can be significantly reduced. We improve the efficiency of the proposed system further by applying a previously proposed observation reordering based pruning (ORBP) to screen out unlikely candidate speakers. The approach is evaluated on a speech database of 1031 speakers, in both clean and noisy conditions. The experimental results show that by integrating TBKS and ORBP together we can speed up the computation efficiency by a factor of 15.8 with only a very slight degradation of identification performance, i.e., an increase of 1% of relative error rate, compared with a baseline GMM–UBM system. The improved search efficiency is also robust to additive noise.
机译:我们提出了一种基于树的核选择(TBKS)算法,作为基于高斯混合模型-通用背景模型(GMM-UBM)的说话人识别的一种高效计算方法。通用背景模型中的所有高斯分量首先被层次化地聚集成一棵树,并且相应的声学空间被映射到结构上划分的区域。识别说话者时,将对所有高斯成分的一小部分对每个测试输入特征向量进行评分。 TBKS处理的结果是,可以显着降低计算复杂度。我们通过应用以前提出的基于观察重排序的修剪(ORBP)来筛选不太可能的候选发言者,从而进一步提高了提出的系统的效率。在干净和嘈杂的条件下,该方法在1031个扬声器的语音数据库中进行了评估。实验结果表明,通过将TBKS和ORBP集成在一起,我们可以将计算效率提高15.8倍,而识别性能却只有很小的下降,即与基线GMM相比,相对错误率提高了1% –UBM系统。改进的搜索效率还可以抵抗加性噪声。

著录项

  • 来源
    《Speech Communication》 |2006年第10期|p. 1273-1282|共10页
  • 作者单位

    Center for Speech Technology, State Key Laboratory of Intelligent Technology and Systems, Department of Computer Science and Technology, Tsinghua University, Beijing 100084, China;

    Center for Speech Technology, State Key Laboratory of Intelligent Technology and Systems, Department of Computer Science and Technology, Tsinghua University, Beijing 100084, China;

    Beijing d-Ear Technologies Co., Ltd,1 China;

    Microsoft Research Asia,2 China;

    Center for Speech Technology, State Key Laboratory of Intelligent Technology and Systems, Department of Computer Science and Technology, Tsinghua University, Beijing 100084, China;

  • 收录信息
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类 语言、文字;
  • 关键词

    Speaker recognition; Speaker identification; Tree-based kernel selection; GMM–UBM;

    机译:说话人识别;说话人识别;基于树的内核选择;GMM–UBM;

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号