首页> 外文会议>IEEE International Conference on Acoustics, Speech and Signal Processing >Speaker diarization of broadcast streams using two-stage clustering based on i-vectors and cosine distance scoring
【24h】

Speaker diarization of broadcast streams using two-stage clustering based on i-vectors and cosine distance scoring

机译:基于I-vORS和余弦距离评分的双级聚类,扬声器日复速制

获取原文

摘要

In this paper we present our system for speaker diarization of broadcast news based on recent advances in the speaker recognition field. In the system, speaker segments determined by the speaker change-point detector are represented by i-vectors and similarity of segments' speakers evaluated using cosine distance scoring. Linear discriminant analysis is employed to cope with intra-speaker variability. The experiments were carried out using the COST278 multilingual broadcast news database. We demonstrate improvement of the performance over the baseline system based on the Bayesian Information Criterion (BIC) and highlight significant impact of cepstral mean normalization. Finally, two-stage clustering employing BIC-based clustering to pre-cluster segments in the first stage is examined and showed to yield further performance improvement. The best performing configuration of our system achieved 52.4% relative improvement of the speaker error rate over the baseline.
机译:本文基于扬声器识别领域的最近进步,我们介绍了我们的扬声器日益改复。在系统中,由扬声器变化点检测器确定的扬声器段由使用余弦距离评分评估的段扬声器的i-vector和相似性表示。采用线性判别分析来应对扬声器内变异性。使用成本278多语言广播新闻数据库进行实验。我们展示了基于贝叶斯信息标准(BIC)对基线系统的性能的改进,并突出了抗康斯兰均值的显着影响。最后,研究了在第一阶段中使用基于BIC基础聚类的两级聚类,并显示出进一步的性能改善。我们系统的最佳表现配置实现了52.4%的相对改善了基线的扬声器错误率。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号