首页> 外文会议>INTERSPEECH 2012 >I-vectors and ILP clustering adapted to cross-show speaker diarization
【24h】

I-vectors and ILP clustering adapted to cross-show speaker diarization

机译:i-vectors和ILP聚类适用于跨展示扬声器日益改估

获取原文

摘要

We propose to study speaker diarization from a collection of audio documents. The goal is to detect speakers appearing in several shows. In our approach, each show of the collection is processed separately before being processed collectively, to group speakers involved in several shows. Two clustering methods are studied for the overall processing of the collection: one uses the NCLR metric and the other is inspired by techniques based on i-vectors, mainly used in the speaker verification field. Both methods were evaluated on the whole training corpus of ESTER 2. The method based on,the use of i-vectors achieves error rates similar to those obtained by the NCLR method, however, the computation time is on average 8.66 times faster. Therefore, this method is suitable for processing large volumes of data.
机译:我们建议从一系列音频文件学习扬声器日益改血。目标是检测几个节目中出现的扬声器。在我们的方法中,在共同处理之前,每个集合的每个节目都是单独处理的,对涉及几个节目的小组扬声器。研究了两个聚类方法,用于集合的整体处理:一个使用NCLR度量,另一个由基于I-Viptors的技术启发,主要用于扬声器验证领域。在酯类的整个训练语料库中评估了两种方法。该方法基于,使用I-vOcs的使用与NCLR方法获得的误差率类似,但是,计算时间平均速度更快8.66倍。因此,该方法适用于处理大量数据。

著录项

相似文献

  • 外文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号