首页> 外文会议>Spoken Language Technology Workshop >DOVER-Lap: A Method for Combining Overlap-Aware Diarization Outputs
【24h】

DOVER-Lap: A Method for Combining Overlap-Aware Diarization Outputs

机译:Dover-Lap:一种结合重叠感知日益升级输出的方法

获取原文

摘要

Several advances have been made recently towards handling overlapping speech for speaker diarization. Since speech and natural language tasks often benefit from ensemble techniques, we propose an algorithm for combining outputs from such diarization systems through majority voting. Our method, DOVER-Lap, is inspired from the recently proposed DOVER algorithm, but is designed to handle overlapping segments in diarization outputs. We also modify the pair-wise incremental label mapping strategy used in DOVER, and propose an approximation algorithm based on weighted k-partite graph matching, which performs this mapping using a global cost tensor. We demonstrate the strength of our method by combining outputs from diverse systems — clustering-based, region proposal networks, and target-speaker voice activity detection — on AMI and LibriCSS datasets, where it consistently outperforms the single best system. Additionally, we show that DOVER-Lap can be used for late fusion in multichannel diarization, and compares favorably with early fusion methods like beamforming.
机译:最近已经对处理扬声器日益改估的重叠言论进行了一些进展。由于语音和自然语言任务通常受益于集合技术,因此我们提出了一种通过多数投票来组合来自这种日复化系统的输出的算法。我们的方法Dover-Lap是从最近提出的多佛算法的启发,但旨在处理深度化输出中的重叠段。我们还修改了多遍的成对增量标签映射策略,并提出了一种基于加权k-partipe图匹配的近似算法,其使用全局成本张量来执行此映射。我们通过组合来自基于系统聚类的,区域提议网络和目标扬声器语音活动检测的输出来展示我们的方法的强度 - 在AMI和Librics数据集中,它一直始终优于单一最佳系统。此外,我们表明Dover-Lap可用于多通道日期中的晚期融合,并与早期融合方法相比,如波束形成。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号