首页> 外文会议>Odyssey 2010: the speaker and language recognition workshop >Unsupervised Compensation of Intra-Session Intra-Speaker Variability for Speaker Diarization
【24h】

Unsupervised Compensation of Intra-Session Intra-Speaker Variability for Speaker Diarization

机译:会话内扬声器内变异的无监督补偿

获取原文
获取原文并翻译 | 示例

摘要

This paper presents a novel framework for unsupervised compensation of intra-session intra-speaker variability in the context of speaker diarization. Audio files are parameterized by sequences of GMM-supervectors representing overlapping short segments of speech. Session-dependent intra-session intra-speaker variability is estimated in an unsupervised manner, and is compensated using the nuisance attribute projection (NAP) method. The proposed compensation method is evaluated in the context of speaker diarization in two-speaker conversations. A simple and effective two-speaker diarization algorithm is introduced in which speaker diarization is performed in the compensated supervector-space. The proposed diarization algorithm was evaluated on summed telephone conversations and achieved a speaker error rate of 2.8% which is a 54% relative error reduction compared to a baseline BIC-based system. Finally, we evaluate the proposed system on a speaker recognition task in the summed-speech condition where improvement in speaker recognition accuracy is observed using the proposed diarization system.
机译:本文提出了一种新颖的框架,用于在说话者差异化的背景下进行会话内说话者内部可变性的无监督补偿。音频文件由代表语音重叠短片段的GMM超向量序列进行参数化。依赖于会话的会话内说话者内部的变异性是以无监督的方式估算的,并使用讨厌的属性投影(NAP)方法进行补偿。在两个说话者的对话中,在说话者差异化的背景下对提出的补偿方法进行了评估。提出了一种简单有效的二扬声器二值化算法,该算法在补偿的超向量空间中进行二值化。在总的电话交谈中评估了所提出的差异化算法,该算法实现了2.8%的说话人错误率,与基于基线BIC的系统相比,相对错误率降低了54%。最后,我们在求和语音条件下评估提出的系统对说话人识别任务的影响,其中使用提出的差分系统观察到说话人识别准确性的提高。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号