首页> 外文会议>IEEE Automatic Speech Recognition and Understanding Workshop >Incorporating Prior Knowledge into Speaker Diarization and Linking for Identifying Common Speaker
【24h】

Incorporating Prior Knowledge into Speaker Diarization and Linking for Identifying Common Speaker

机译:将先验知识纳入说话人区分和链接以识别普通说话人

获取原文

摘要

Speaker Diarization and Linking discovers “who spoke when” across recordings without any speaker enrollment. Diarization is performed on each recording separately, and the linking combines clusters of the same speaker across recordings. It is a two-step approach, however it suffers from propagating the error from diarization step to the linking step. In a situation where a unique speaker appears in a given set of recordings, this paper aims at locating the common speaker using the prior knowledge of his or her existence. That means there is no enrollment data for this common speaker. We propose Pairwise Common Speaker Identification (PCSI) method that takes the existence of a common speaker into account in contrast to the two-step approach. We further show that PCSI can be used to reduce the errors that are introduced in the diarization step of the two-step approach. Our experiments are performed on a corpus synthesised from the AMI corpus and also on a in-house conversational telephony Sichuanese corpus that is mixed with Mandarin. We show up to 7.68% relative improvements of time-weighted equal error rate over a state-of-art x-vector diarization and linking system.
机译:扬声器日益增估和连接“在没有任何发言者入学的记录中谈论”谁说话。在每次记录上进行日复速度,并且连接在录制中结合了同一扬声器的簇。这是一种两步的方法,但它遭受从日复速度步骤到连接步骤的误差。在一个独特的扬声器出现在一套特定的录音中的情况下,本文旨在使用他或她的存在的先验知识定位公共扬声器。这意味着这个公共扬声器没有注册数据。我们提出了成对的通用扬声器识别(PCSI)方法,其与两步方法相比,考虑了共同扬声器的存在。我们进一步表明,PCSI可用于减少两步方法的日复速度步骤中引入的错误。我们的实验是对从AMI语料库合成的语料库中进行的,也是在内部的谈话电话中,季度语料库与普通话混合。在最先进的X-载体日期和连接系统上显示出高达7.68%的相对改善时间加权等误码率的相对改善。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号