首页> 外文会议>Annual Conference of the International Speech Communication Association >Investigating Various Diarization Algorithms for Speaker in the Wild (SITW) Speaker Recognition Challenge
【24h】

Investigating Various Diarization Algorithms for Speaker in the Wild (SITW) Speaker Recognition Challenge

机译:在野生(SITW)扬声器识别挑战中调查扬声器的各种日复一衰算法

获取原文

摘要

Collecting training data for real-world text-independent speaker recognition is challenging. In practice, utterances for a specific speaker are often mixed with many other acoustic signals. To guarantee the recognition performance, the segments spoken by target speakers should be precisely picked out. An automatic detection could be developed to reduce the cost of expensive human hand-made annotations. One way to achieve this goal is by using speaker diarization as a pre-processing step in the speaker enrollment phase. To this end, three speaker diarization algorithms based on Bayesian information criterion (BIC), agglomerative information bottleneck (aIB) and i-vector are investigated in this paper. The corresponding impacts on the results of speaker recognition system are also studied. Experiments conducted on Speaker in the Wild (SITW) Speaker Recognition Challenge (SRC) 2016 showed that the utilization of a proper speaker diarization improves the overall performance. Some more efforts are made to combine these methods together as well.
机译:收集现实世界文本独立扬声器识别的培训数据是具有挑战性的。在实践中,特定扬声器的话语通常与许多其他声学信号混合。为了保证识别性能,应准确地挑选目标发言者所说的细分。可以开发自动检测以降低昂贵的人类手工制作注释的成本。实现这一目标的一种方法是通过使用扬声器日益改估作为扬声器注册阶段的预处理步骤。为此,本文研究了基于贝叶斯信息标准(BIC),附聚信息瓶颈(AIB)和I形载体的三个扬声器深度算法。还研究了对扬声器识别系统结果的相应影响。在野外(SITW)扬声器识别挑战(SRC)2016上对扬声器进行的实验表明,利用适当的扬声器深度提高了整体性能。还有一些努力也将这些方法组合在一起。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号