首页> 外文期刊>Digital Signal Processing >Approaches to speaker detection and tracking in conversational speech
【24h】

Approaches to speaker detection and tracking in conversational speech

机译:会话语音中说话人检测和跟踪的方法

获取原文
获取原文并翻译 | 示例
           

摘要

Two approaches to detecting and tracking speakers in multispeaker audio are described. Both approaches use an adapted Gaussian mixture model, universal background model (GMM-UBM) speaker detection system as the core speaker recognition engine. In one approach, the individual log-likelihood ratio scores, which are produced on a frame-by-frame basis by the GMM-UBM system, are used to first partition the speech file into speaker homogenous regions and then to create scores for these regions. We refer to this approach as internal segmentation. Another approach uses an external segmentation algorithm, based on blind clustering, to partition the speech file into speaker homogenous regions. The adapted GMM-UBM system then scores each of these regions as in the single-speaker recognition case. We show that the external segmentation system outperforms the internal segmentation system for both detection and tracking. In addition, we show how different components of the detection and tracking algorithms contribute to the overall system performance. (C) 2000 Academic Press. [References: 15]
机译:描述了检测和跟踪多扬声器音频中的扬声器的两种方法。两种方法都使用自适应的高斯混合模型,通用背景模型(GMM-UBM)说话者检测系统作为核心说话者识别引擎。在一种方法中,由GMM-UBM系统逐帧生成的单个对数似然比得分用于首先将语音文件划分为说话者同质区域,然后为这些区域创建得分。我们将此方法称为内部细分。另一种方法是使用基于盲聚类的外部分段算法,将语音文件划分为多个说话者同质区域。然后,如在单讲话者识别情况下,适应的GMM-UBM系统对这些区域中的每一个进行评分。我们表明,外部分割系统在检测和跟踪方面都优于内部分割系统。此外,我们展示了检测和跟踪算法的不同组件如何对整体系统性能做出贡献。 (C)2000学术出版社。 [参考:15]

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号