首页> 外文会议>International Symposium on Chinese Spoken Language Processing >Exploring a Unified Attention-Based Pooling Framework for Speaker Verification
【24h】

Exploring a Unified Attention-Based Pooling Framework for Speaker Verification

机译:探索用于说话人验证的基于注意力的统一池框架

获取原文

摘要

The pooling layer is an essential component in the neural network based speaker verification. Most of the current networks in speaker verification use average pooling to derive the utterance-level speaker representations. Average pooling takes every frame as equally important, which is suboptimal since the speaker-discriminant power is different between speech segments. In this paper, we present a unified attention-based pooling framework and combine it with the multi-head attention. Experiments on the Fisher and NIST SRE 2010 dataset show that involving outputs from lower layers to compute the attention weights can outperform average pooling and achieve better results than vanilla attention method. The multi-head attention further improves the performance.
机译:池化层是基于神经网络的说话者验证中的重要组成部分。说话人验证中当前的大多数网络都使用平均池来得出话语级别的说话人表示。平均池化将每个帧都视为同等重要,这是次优的,因为在语音段之间,区分说话者的能力是不同的。在本文中,我们提出了一个基于注意力的统一池框架,并将其与多头注意力结合在一起。在Fisher和NIST SRE 2010数据集上进行的实验表明,使用较低层的输出来计算注意力权重可以胜过平均池化,并且比香草注意力方法要好。多头注意力进一步提高了性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号