【24h】

Extending the Task of Diarization to Speaker Attribution

机译:将差异化的任务扩展到说话者归因

获取原文

摘要

In this paper we extend the concept of speaker annotation within a single-recording, or speaker diarization, to a collection wide approach we call speaker attribution. Accordingly, speaker attribution is the task of clustering expectantly homogenous inter-session clusters obtained using diarization according to common cross-recording identities. The result of attribution is a collection of spoken audio across multiple recordings attributed to speaker identities. In this paper, an attribution system is proposed using mean-only MAP adaptation of a combined-gender UBM to model clusters from a perfect diarization system, as well as a JFA-based system with session variability compensation. The normalized cross-likelihood ratio is calculated for each pair of clusters to construct an attribution matrix and the complete linkage algorithm is employed to conduct clustering of the inter-session clusters. A matched cluster purity and coverage of 87.1% was obtained on the NIST 2008 SRE corpus.
机译:在本文中,我们将单次录音或说话者二值化内的说话者注释概念扩展到了我们称为说话者归因的广泛收集方法中。因此,说话者归因是根据通用的交叉记录身份对使用均匀化获得的预期同质的会话间聚类进行聚类的任务。归因的结果是归因于说话者身份的多个录音中的语音音频集合。在本文中,提出了一种归因系统,该归因系统使用了组合性别UBM的平均均值MAP自适应方法,以从理想的离散化系统以及具有会话可变性补偿的基于JFA的系统中对集群进行建模。为每对集群计算归一化的交叉似然比,以构造一个归因矩阵,并采用完整的链接算法对会话间集群进行聚类。在NIST 2008 SRE语料库中获得了匹配的簇纯度和87.1%的覆盖率。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号