Exploring a Unified Attention-Based Pooling Framework for Speaker Verification

机译：探索用于说话人验证的基于注意力的统一池框架

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

The pooling layer is an essential component in the neural network based speaker verification. Most of the current networks in speaker verification use average pooling to derive the utterance-level speaker representations. Average pooling takes every frame as equally important, which is suboptimal since the speaker-discriminant power is different between speech segments. In this paper, we present a unified attention-based pooling framework and combine it with the multi-head attention. Experiments on the Fisher and NIST SRE 2010 dataset show that involving outputs from lower layers to compute the attention weights can outperform average pooling and achieve better results than vanilla attention method. The multi-head attention further improves the performance.

机译：池化层是基于神经网络的说话者验证中的重要组成部分。说话人验证中当前的大多数网络都使用平均池来得出话语级别的说话人表示。平均池化将每个帧都视为同等重要，这是次优的，因为在语音段之间，区分说话者的能力是不同的。在本文中，我们提出了一个基于注意力的统一池框架，并将其与多头注意力结合在一起。在Fisher和NIST SRE 2010数据集上进行的实验表明，使用较低层的输出来计算注意力权重可以胜过平均池化，并且比香草注意力方法要好。多头注意力进一步提高了性能。

著录项

来源
《International Symposium on Chinese Spoken Language Processing》|2018年|200-204|共5页
会议地点
作者
Yi Liu; Liang He; Weiwei Liu; Jia Liu;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Feature extraction; Training; Neural networks; NIST; Computational modeling; Phonetics; Acoustics;

机译：特征提取训练神经网络NIST计算模型语音声学;

相似文献

外文文献
中文文献
专利

1. A unified framework for score normalization techniques applied to text-independent speaker verification [J] . Mariethoz J., Bengio S. IEEE signal processing letters . 2005,第7期

机译：适用于与文本无关的说话人验证的分数归一化技术的统一框架
2. Exploring Text-Constraint Models and Source Information for Long-Enrollment with Short-Test Speaker Verification [J] . Das Rohan Kumar, Jelil Sarfaraz, Prasanna S. R. Mahadeva Circuits, systems, and signal processing . 2019,第4期

机译：探索文本约束模型和源信息以进行短期测试说话者验证的长期入学
3. Exploring kernel discriminant analysis for speaker verification with limited test data [J] . Das Rohan Kumar, Manam Akhil Babu, Prasanna S. R. Mahadeva Pattern recognition letters . 2017,第octa15期

机译：探索内核判别分析以使用有限的测试数据进行说话人验证
4. Exploring a Unified Attention-Based Pooling Framework for Speaker Verification [C] . Yi Liu, Liang He, Weiwei Liu, International Symposium on Chinese Spoken Language Processing . 2018

机译：探索扬声器验证的统一关注汇编框架
5. Exploring Middle School Students' Representational Competence in Science: Development and Verification of a Framework for Learning with Visual Representations [D] . Tippett, Christine Diane 2011

机译：探索中学生在科学中的表征能力：视觉表征学习框架的开发和验证
6. A unified software framework for deriving visualizing and exploring abstraction networks for ontologies [O] . Christopher Ochs, James Geller, Yehoshua Perl, -1

机译：用于派生可视化和探索本体抽象网络的统一软件框架
7. 1 A Unified Framework for Score Normalization Techniques Applied to Text Independent Speaker Verification [O] . Johnny Mariéthoz, Samy Bengio 2010

机译：1用于文本独立说话人验证的分数标准化技术的统一框架
8. Unified Framework for Verification and Complexity Analysis of Real-Time andDistributed Systems [R] . Lynch, N. 1997

机译：实时分布式系统验证与复杂性分析的统一框架

Exploring a Unified Attention-Based Pooling Framework for Speaker Verification

摘要

著录项

相似文献

相关主题

期刊订阅