首页> 外文会议>European Signal Processing Conference >Feature selection and model optimization for semi-supervised speaker spotting
【24h】

Feature selection and model optimization for semi-supervised speaker spotting

机译:半监督说话人发现的特征选择和模型优化

获取原文

摘要

We explore, experimentally, feature selection and optimization of stochastic model parameters for the problem of speaker spotting. Based on an initially identified segment of speech of a speaker, an iterative model refinement method is developed along with a latent variable mixture model so that segments of the same speaker are identified in a long speech record. It is found that a GMM with moderate number of mixtures is better suited for the task than a large number mixture model as used in speaker identification. Similarly, a PCA based low-dimensional projection of MFCC based feature vector provides better performance. We show that about 6 seconds of initially identified speaker data is sufficient to achieve > 90% performance of speaker segment identification.
机译:我们通过实验探索随机模型参数的特征选择和优化,以解决说话人发现问题。基于说话者的最初识别出的语音片段,开发了一种迭代模型细化方法以及一个潜在变量混合模型,以便在长语音记录中识别出同一说话者的片段。结果发现,与中等数量的混合气模型相比,说话人识别中使用的大量混合气模型更适合该任务。类似地,基于MFCC的特征向量的基于PCA的低维投影提供了更好的性能。我们显示,大约6秒钟的初始识别说话者数据足以实现> 90%的说话者片段识别性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号