首页> 外文会议>International Conference on speech and computer >Phase Analysis and Labeling Strategies in a CNN-Based Speaker Change Detection System
【24h】

Phase Analysis and Labeling Strategies in a CNN-Based Speaker Change Detection System

机译:基于CNN的说话者变化检测系统中的相位分​​析和标记策略

获取原文

摘要

In this paper we analyze different labeling strategies and their impact on speaker change detection rates. We explore binary, linear fuzzy, quadratic and Gaussian labeling functions. We come to the conclusion that the labeling function is very important and the linear variant outperforms the rest. We also add phase information from the spectrum to the input of our convolutional neural network. Experiments show that even though the phase is informative its benefit is negligible and may be omitted. In the experiments we use a coverage-purity measure which is independent on tolerance parameters.
机译:在本文中,我们分析了不同的标记策略及其对说话人变化检测率的影响。我们探索二进制,线性模糊,二次和高斯标记函数。我们得出的结论是,标记功能非常重要,线性变量优于其他变量。我们还将频谱中的相位信息添加到卷积神经网络的输入中。实验表明,即使该阶段提供了有益的信息,它的好处还是可以忽略不计的,因此可以省略。在实验中,我们使用了覆盖纯度测量方法,该方法与公差参数无关。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号