首页> 外文会议>2018 IEEE Spoken Language Technology Workshop >Attention Mechanism in Speaker Recognition: What Does it Learn in Deep Speaker Embedding?
【24h】

Attention Mechanism in Speaker Recognition: What Does it Learn in Deep Speaker Embedding?

机译:说话人识别中的注意力机制:在深度说话人嵌入中学到什么?

获取原文
获取原文并翻译 | 示例

摘要

This paper presents an experimental study on deep speaker embedding with an attention mechanism that has been found to be a powerful representation learning technique in speaker recognition. In this framework, an attention model works as a frame selector that computes an attention weight for each frame-level feature vector, in accord with which an utterance-level representation is produced at the pooling layer in a speaker embedding network. In general, an attention model is trained together with the speaker embedding network on a single objective function, and thus those two components are tightly bound to one another. In this paper, we consider the possibility that the attention model might be decoupled from its parent network and assist other speaker embedding networks and even conventional i-vector extractors. This possibility is demonstrated through a series of experiments on a NIST Speaker Recognition Evaluation (SRE) task, with 9.0% EER reduction and 3.8% minCprimary reduction when the attention weights are applied to i-vector extraction. Another experiment shows that DNN-based soft voice activity detection (VAD) can be effectively combined with the attention mechanism to yield further reduction of minCprimary by 6.6% and 1.6% in deep speaker embedding and i-vector systems, respectively.
机译:本文提出了一种基于深层说话人嵌入的实验研究,该机制具有注意力机制,已被发现是说话人识别中一种强大的表示学习技术。在此框架中,注意力模型用作帧选择器,为每个帧级特征向量计算注意力权重,据此,说话人嵌入网络中的汇聚层将产生话语级表示。通常,注意力模型与说话人嵌入网络一起在单个目标函数上进行训练,因此,这两个组件彼此紧密地绑定在一起。在本文中,我们考虑了注意力模型可能与其父网络分离的可能性,并可以帮助其他说话者嵌入网络,甚至传统的i-vector提取器。通过NIST说话者识别评估(SRE)任务的一系列实验证明了这种可能性,EER降低9.0 \\%,minC降低3.8 \\%主要 \ n减少注意权重应用于i-vector萃取。另一个实验表明,基于DNN的软语音活动检测(VAD)可以与注意力机制有效结合,从而进一步降低minC \ n 主要的 \ n在深层发言人嵌入和i中分别降低了6.6 \\%和1.6 \\% -vector系统。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号