首页> 外文会议>Odyssey 2010: the speaker and language recognition workshop >Factor analysis-based approaches applied to the speaker diarization task of meetings: a preliminary study
【24h】

Factor analysis-based approaches applied to the speaker diarization task of meetings: a preliminary study

机译:基于因子分析的方法适用于会议发言人的差异化任务:初步研究

获取原文
获取原文并翻译 | 示例

摘要

This paper presents a preliminary study on the use of the Factor Analysis (FA) methods in an automatic speaker diarization process, dedicated to the meeting rooms. Indeed, the speaker diarization process, based on the top-down E-HMM approach, integrates a FA-based speaker modeling in an additional resegmentation step, which aims at helping the refinement of the output segmentation. Classically applied in speaker recognition to deal with channel variability issues, two main schemes of the FA application are proposed here: to deal with the (1) inter-speaker variability and with (2) the inter-segment variability. Different kinds of experiments have been conducted on the dataset of the last NIST/RT'09 evaluation campaign, leading to very interesting and promising results. For instance, they show that the couple of schemes proposed in this paper obtained competitive performance, compared to the baseline process, despite the small amount of development data used in this paper for the FA parameter estimation. Unexpectedly, they tend to show that the inter-segment variability component can be helpful for speaker diarization.
机译:本文介绍了针对会议室专用的自动扬声器二值化过程中因素分析(FA)方法的使用的初步研究。的确,基于自上而下的E-HMM方法的说话人区分过程将基于FA的说话人建模集成到一个额外的细分步骤中,该步骤旨在帮助优化输出细分。在语音识别中经典地用于处理信道可变性的问题,在此提出了FA应用的两个主要方案:处理(1)说话者间可变性和(2)段间可变性。在最近的NIST / RT'09评估活动的数据集上进行了各种实验,得出了非常有趣和有希望的结果。例如,他们表明,尽管本文中用于FA参数估计的开发数据量很少,但与基线过程相比,本文提出的两种方案仍具有竞争优势。出乎意料的是,他们倾向于表明段间可变性成分可能有助于说话人区分。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号