首页> 外文OA文献 >A speaker rediarization scheme for improving diarization in large two-speaker telephone datasets
【2h】

A speaker rediarization scheme for improving diarization in large two-speaker telephone datasets

机译:一种扬声器重拨方案,用于改善大型两个扬声器的电话数据集中的差异化

摘要

In this paper we propose a novel scheme for carrying out speaker diarization in an iterative manner. We aim to show that the information obtained through the first pass of speaker diarization can be reused to refine and improve the original diarization results. We call this technique speaker rediarization and demonstrate the practical application of our rediarization algorithm using a large archive of two-speaker telephone conversation recordings. We use the NIST 2008 SRE summed telephone corpora for evaluating our speaker rediarization system. This corpus contains recurring speaker identities across independent recording sessions that need to be linked across the entire corpus. We show that our speaker rediarization scheme can take advantage of inter-session speaker information, linked in the initial diarization pass, to achieve a 30% relative improvement over the original diarization error rate (DER) after only two iterations of rediarization.
机译:在本文中,我们提出了一种新颖的以迭代方式执行说话人区分的方案。我们旨在表明,通过说话人数字化处理的第一遍获得的信息可以重复使用,以细化和改善原始的数字化处理结果。我们称这种技术为扬声器重拨,并使用大量包含两个扬声器的通话记录来演示我们的重拨算法的实际应用。我们使用NIST 2008 SRE汇总的电话资料来评估我们的扬声器重拨系统。该语料库包含独立录音会话中反复出现的说话者身份,需要在整个语料库之间进行链接。我们证明了我们的说话人重化方案可以利用会话间的说话人信息(在初始二值化过程中链接),在经过两次重化后,相对于原始二值化错误率(DER)可获得30%的相对改善。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号