首页> 外文会议>Annual conference of the International Speech Communication Association;INTERSPEECH 2011 >Comparing Multi-Stage Approaches for Cross-Show Speaker Diarization
【24h】

Comparing Multi-Stage Approaches for Cross-Show Speaker Diarization

机译:比较跨阶段演讲者差异化的多阶段方法

获取原文

摘要

Acoustic speaker diarization is investigated for situations where a collection of shows from the same source needs to be processed. In this case, the same speaker should receive the same label across all shows. We compare different architectures for cross-show speaker diarization: the obvious concatenation of all shows, a hybrid system combining first a local clustering stage followed by a global clustering stage, and an incremental system which processes the shows in a predefined order and updates the speaker models accordingly. This latter system being best suited to real applicative situations. These three strategies were compared to a baseline single-show system on a set of 46 ten-minutes samples of British English scientific podcasts.
机译:针对需要处理来自同一来源的节目集合的情况,对声学扬声器的二元化进行了研究。在这种情况下,同一位发言人在所有演出中都应获得相同的标签。我们比较了跨节目演讲者差异化的不同体系结构:所有节目的明显串联;混合系统首先结合了本地聚类阶段,然后是全局聚类阶段;还有一个增量系统,该系统按预定义的顺序处理节目并更新说话者相应地进行建模。后一种系统最适合实际应用情况。在一组46个十分钟的英式英语科学播客样本中,将这三种策略与基线单播系统进行了比较。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号