【24h】

The speaker partitioning problem

机译:扬声器分区问题

获取原文
获取原文并翻译 | 示例

摘要

We give a unification of several different speaker recognition problems in terms of the general speaker partitioning problem, where a set of TV inputs has to be partitioned into subsets according to speaker. We show how to solve this problem in terms of a simple generative model and demonstrate performance on NIST SRE 2006 and 2008 data. Our solution yields probabilistic outputs, which we show how to evaluate with a cross-entropy criterion. Finally, we show improved accuracy of the generative model via a discriminatively trained re-calibration transformation of log-likelihoods.
机译:根据一般的说话人划分问题,我们将几个不同的说话人识别问题统一起来,在该问题中,必须根据说话人将一组电视输入划分为子集。我们展示了如何通过简单的生成模型解决此问题,并展示了NIST SRE 2006和2008数据的性能。我们的解决方案产生概率输出,我们将展示如何使用交叉熵准则进行评估。最后,我们通过对数似然的判别训练后的重新标定转换来显示生成模型的准确性提高。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号