首页> 外文会议>Annual conference of the International Speech Communication Association >Convolutive Non-Negative Sparse Coding and New Features for Speech Overlap Handling in Speaker Diarization
【24h】

Convolutive Non-Negative Sparse Coding and New Features for Speech Overlap Handling in Speaker Diarization

机译:卷积非负稀疏编码和说话人区分中语音重叠处理的新功能

获取原文

摘要

The effective handling of overlapping speech is at the limits of the current state of the art in speaker diarization. This paper presents our latest work in overlap detection. We report the combination of features derived through convolutive non-negative sparse coding and new energy, spectral and voicing-related features within a conventional HMM system. Overlap detection results are fully integrated into our top-down diarization system through the application of overlap exclusion and overlap labeling. Experiments on a subset of the AMI corpus show that the new system delivers significant reductions in missed speech and speaker error. Through overlap exclusion and labelling the overall diarization error rate is shown to improve by 6.4 % relative.
机译:重叠语音的有效处理处于说话者二值化的当前技术水平的极限。本文介绍了我们在重叠检测方面的最新工作。我们报告了通过传统的HMM系统中的卷积非负稀疏编码和新能源,频谱和语音相关特征得出的特征的组合。通过应用重叠排除和重叠标记,重叠检测结果已完全集成到我们的自上而下的数字化系统中。在AMI语料库的子集上进行的实验表明,新系统显着减少了语音遗漏和说话人错误。通过重叠排除和标记,总体偏差误差率相对提高了6.4%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号