首页> 外文会议>Annual conference of the International Speech Communication Association >I-vectors and ILP clustering adapted to cross-show speaker diarization
【24h】

I-vectors and ILP clustering adapted to cross-show speaker diarization

机译:I矢量和ILP聚类适用于跨场演说者差异化

获取原文

摘要

We propose to study speaker diarization from a collection of audio documents. The goal is to detect speakers appearing in several shows. In our approach, each show of the collection is processed separately before being processed collectively, to group speakers involved in several shows. Two clustering methods are studied for the overall processing of the collection: one uses the NCLR metric and the other is inspired by techniques based on i-vectors, mainly used in the speaker verification field. Both methods were evaluated on the whole training corpus of ESTER 2. The method based on the use of i-vectors achieves error rates similar to those obtained by the NCLR method, however, the computation time is on average 8.66 times faster. Therefore, this method is suitable for processing large volumes of data.
机译:我们建议从音频文档的集合中研究说话者的歧义化。目的是检测出现在几场演出中的说话者。在我们的方法中,收藏集的每个节目在进行集体处理之前都将被分别处理,以将参与多个节目的演讲者分组。研究了两种用于集合总体处理的聚类方法:一种使用NCLR度量,另一种则受到基于i矢量的技术的启发,主要用于说话人验证领域。两种方法都在ESTER 2的整个训练语料库上进行了评估。基于i向量的方法的错误率与NCLR方法相似,但是计算时间平均快了8.66倍。因此,此方法适用于处理大量数据。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号