首页> 外文会议>International Workshop on Machine Learning for Multimodal Interaction >A Hybrid Generative-Discriminative Approach to Speaker Diarization
【24h】

A Hybrid Generative-Discriminative Approach to Speaker Diarization

机译:一种杂交生成鉴别的扬声器日益改复方法

获取原文

摘要

In this paper we present a sound probabilistic approach to speaker diarization. We use a hybrid framework where a distribution over the number of speakers at each point of a multimodal stream is estimated with a discriminative model. The output of this process is used as input in a generative model that can adapt to a novel test set and perform high accuracy speaker diarization. We manage to deal efficiently with the less common, and therefore harder, segments like silence and multiple speaker parts in a principled probabilistic manner.
机译:在本文中,我们提出了一种对扬声器日益改估的概率方法。我们使用混合框架,其中通过判别模型估计多模阶流的每个点处的扬声器数量的分布。该过程的输出用作生成模型中的输入,可以适应新型测试集并进行高精度扬声器日益率。我们设法以不太常见,更难的段,如沉默和多个扬声器零件的较小常见,以及以原则的概率方式处理。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号