Unsupervised Speaker Adaptation based on the Cosine Similarity for Text-Independent Speaker Verification

机译：基于余弦相似度的无监督说话人自适应，用于独立于文本的说话人验证

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

This paper proposes a new approach to unsupervised speaker adaptation inspired by the recent success of the factor analysis-based Total Variability Approach to text-independent speaker verification [1]. This approach effectively represents speaker variability in terms of low-dimensional total factor vectors and, when paired alongside the simplicity of cosine similarity scoring, allows for easy manipulation and efficient computation [2]. The development of our adaptation algorithm is motivated by the desire to have a robust method of setting an adaptation threshold, to minimize the amount of required computation for each adaptation update, and to simplify the associated score normalization procedures where possible. To address the final issue, we propose the Symmetric Normalization (S-norm) method, which takes advantage of the symmetry in cosine similarity scoring and achieves competitive performance to that of the ZT-norm while requiring fewer parameter calculations. In subsequent experiments, we also assess an attempt to replace the use of score normalization procedures altogether with a Normalized Cosine Similarity scoring function [3].rnWe evaluated the performance of our unsupervised speaker adaptation algorithm under various score normalization procedures on the l0sec-l0sec and core conditions of the 2008 NIST SRE dataset. Using results without adaptation as our baseline, it was found that the proposed methods are consistent in successfully improving speaker verification performance to achieve state-of-the-art results.

机译：本文提出了一种新的无监督说话人适应方法，该方法受基于因子分析的总可变性方法在不依赖文本的说话人验证中的最新成功的启发[1]。这种方法有效地表示了说话人在低维总因子矢量方面的可变性，并且与余弦相似性评分的简单性一起使用时，易于操作和高效计算[2]。我们的自适应算法的开发是出于对一种具有鲁棒性的方法来设置自适应阈值，最小化每次自适应更新所需的计算量以及在可能的情况下简化相关分数标准化过程的渴望。为了解决最后一个问题，我们提出了对称归一化（S-norm）方法，该方法在余弦相似度评分中利用了对称性，并且与ZT-norm相比具有竞争性，同时需要更少的参数计算。在随后的实验中，我们还评估了用标准化的余弦相似性评分功能[3]代替分数标准化程序的尝试。rn我们评估了在10秒至10秒和10秒至10秒之间各种分数标准化程序下无监督说话人自适应算法的性能。 2008 NIST SRE数据集的核心条件。使用没有改编的结果作为我们的基准，发现所提出的方法在成功地提高说话者验证性能以实现最新结果方面是一致的。

著录项

来源
《Odyssey 2010: the speaker and language recognition workshop》|2010年|p.84-90|共7页
会议地点 Brno(CS)
作者
Stephen Shum; Najim Dehak; Reda Dehak; James R. Glass;
展开▼
作者单位

MIT Computer Science and Artificial Intelligence Laboratory 32 Vassar Street, Cambridge, MA 02139, USA;

MIT Computer Science and Artificial Intelligence Laboratory 32 Vassar Street, Cambridge, MA 02139, USA;

Laboratoire de Recherche et de Developpement de l'EPITA (LRDE), Paris, France;

MIT Computer Science and Artificial Intelligence Laboratory 32 Vassar Street, Cambridge, MA 02139, USA;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类语音信号处理;
关键词

相似文献

外文文献
中文文献
专利

1. Non-speaker information reduction from Cosine Similarity Scoring in i-vector based speaker verification [J] . Zeinali Hossein, Mirian Alireza, Sameti Hossein, Computers and Electrical Engineering . 2015,第Null期

机译：基于i向量的说话人验证中基于余弦相似性评分的非说话人信息约简
2. Cross similarity measurement for speaker adaptive test normalization in text-independent speaker verification [J] . ZHAO Jian, DONG Yuan, ZHAO Xian-yu, 中国邮电高校学报（英文版） . 2008,第002期

机译：跨相似度测量，用于独立于文本的说话人验证中的说话人自适应测试标准化
3. A Novel Scoring Method Based on Distance Calculation for Similarity Measurement in Text-Independent Speaker Verification [J] . Soufiane Hourri, Jamal Kharroubi Procedia Computer Science . 2019,第11期

机译：基于距离计算的文本无关说话人验证中相似度测量的一种新评分方法
4. Cross-lingual Text-independent Speaker Verification Using Unsupervised Adversarial Discriminative Domain Adaptation [C] . Wei Xia, Jing Huang, John H.L. Hansen IEEE International Conference on Acoustics, Speech and Signal Processing . 2019

机译：使用无监督的对抗性判别域自适应的跨语言独立于说话者的说话人验证
5. Speaker adaptation in joint factor analysis based text independent speaker verification [D] . Shou-Chun, Yin 2007

机译：基于联合因素分析的文本自适应说话人验证中的说话人适应
6. The Unsupervised Feature Selection Algorithms Based on Standard Deviation and Cosine Similarity for Genomic Data Analysis [O] . Juanying Xie, Mingzhao Wang, Shengquan Xu, 2021

机译：基于标准偏差和基因组数据分析的余弦相似性的无监督特征选择算法
7. Cross-lingual Text-independent Speaker Verification Using Unsupervised Adversarial Discriminative Domain Adaptation [O] . Wei Xia, Jing Huang, John H.L. Hansen 2019

机译：使用无监督的对冲歧视域适应交叉语言无关的扬声器验证
8. Supervised and Unsupervised Speaker Adaptation in the NIST 2005 Speaker Recognition Evaluation [R] . Hansen, E. G. , Slyh, R. E. , Anderson, T. R. 2006

机译：NIsT 2005演讲者识别评估中的监督和无监督演讲者适应

Unsupervised Speaker Adaptation based on the Cosine Similarity for Text-Independent Speaker Verification

摘要

著录项

相似文献

相关主题

期刊订阅