Domain compensation based on phonetically discriminative features for speaker verification

Yanhua Long; Hong Ye; Jifeng Ni

首页> 外文期刊>Computer speech and language >Domain compensation based on phonetically discriminative features for speaker verification

【24h】

Domain compensation based on phonetically discriminative features for speaker verification

机译：基于语音区分功能的域补偿，用于说话人验证

获取原文

获取原文并翻译 | 示例

开具论文收录证明 >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

This paper presents a new domain compensation framework by using phonetically discriminative features which are extracted from domain-dependent deep neural networks (DNNs). The domain compensation can be applied in both unsupervised and supervised manner, depending on whether the domain information of the development data is provided or not in advance. In supervised manner, the DNNs are trained on the development speech recordings of each given domain separately. While in the unsupervised manner, the development datasets are first automatically clustered into different domains, by using the Gaussian Mixture Model mean supervectors which are generated from each of the speech recordings, DNNs are then trained on the resulting clusters. Finally, we compensate the domain variabilities during the target speaker modeling step using support vector machines, by feeding in statistical vectors which are derived from the discriminative features extracted from the domain-dependent DNNs. The main strength of our proposed framework is that it does not need any speaker labels in the development dataset, which makes the proposed framework of great advantage over the state-of-the-art techniques that need speaker labels to train inter-speaker and/or intra-speaker variability models or channel compensation. Three speaker verification systems are investigated to examine the effectiveness of this new framework. Experimental results on the NIST SRE 2010 task demonstrate competitive performances to the state-of-the-art techniques in an initial implementation of the proposed framework.

机译：本文使用从领域相关的深度神经网络（DNN）中提取的语音区分特征，提出了一种新的领域补偿框架。根据是否预先提供了开发数据的域信息，可以以无监督和有监督两种方式应用域补偿。以监督的方式，分别在每个给定域的发展语音记录上对DNN进行训练。在无人监管的情况下，首先通过使用高斯混合模型的平均超向量自动将开发数据集聚到不同的域中，这些平均超向量是从每个语音记录中生成的，然后在所得的聚类上训练DNN。最后，我们使用支持向量机，通过输入统计向量来补偿目标说话人建模步骤中的域变异性，这些统计向量是从从依赖于域的DNN中提取的区分特征中得出的。我们提出的框架的主要优势在于，它在开发数据集中不需要任何说话者标签，这使得该提议的框架相对于需要说话者标签来训练演讲者和/或其他人的最新技术具有很大的优势。或扬声器内可变性模型或通道补偿。研究了三种说话人验证系统，以检查此新框架的有效性。 NIST SRE 2010任务的实验结果证明了在最初实施建议的框架中具有与最新技术相竞争的性能。

著录项

来源
《Computer speech and language》 |2017年第1期|161-179|共19页
作者
Yanhua Long; Hong Ye; Jifeng Ni;
展开▼
作者单位

Department of Electronical and Information Engineering, Shanghai Normal University, Shanghai, 200234, China;

Department of Electronical and Information Engineering, Shanghai Normal University, Shanghai, 200234, China;

Department of Electronical and Information Engineering, Shanghai Normal University, Shanghai, 200234, China;

展开▼
收录信息美国《科学引文索引》(SCI);美国《工程索引》(EI);
原文格式 PDF
正文语种 eng
中图分类
关键词
Discriminative features; Ⅰ-vector; Domain compensation; Deep neural network; Speaker verification;

机译：区别特征;Ⅰ-载体;域补偿;深度神经网络说话者验证;

相似文献

外文文献
中文文献
专利

1. Combination of Cepstral and Phonetically Discriminative Features for Speaker Verification [J] . Sarkar A.K., Do C.-T., Le V.-B., IEEE signal processing letters . 2014,第9期

机译：倒谱和语音辨别功能的组合，用于说话人验证
2. Generalized I-vector Representation with Phonetic Tokenizations and Tandem Features for both Text Independent and Text Dependent Speaker Verification [J] . Li Ming, Liu Lun, Cai Weicheng, Journal of signal processing systems for signal, image, and video technology . 2016,第2期

机译：具有语音分词和串联特性的通用I向量表示，可用于文本无关和文本相关的说话人验证
3. Discriminative likelihood score weighting based on acoustic-phonetic classification for speaker identification [J] . Youngjoo Suh, Hoirin Kim EURASIP journal on advances in signal processing . 2014,第1期

机译：基于语音分类的判别似然评分加权用于说话人识别
4. Speaker Recognition Via Nonlinear Phonetic-and Speaker-Discriminative Features [C] . Lara Stoll, Joe Frankel, Nikki Mirghafori International Conference on Nonlinear Speech Processing . 2008

机译：扬声器识别通过非线性语音和扬声器 - 辨别特征
5. Deep Neural Network Based Speaker Verification Under Domain Mismatched Conditions [D] . Zhang, Chunlei. 2019

机译：基于深度神经网络的扬声器验证在域不匹配条件下
6. Efficient Invariant Features for Sensor Variability Compensation in Speaker Recognition [O] . Abdennour Alimohad, *, Ahmed Bouridane, 2014

机译：说话人识别中传感器可变性补偿的高效不变性
7. Combination of Cepstral and Phonetically Discriminative Features for Speaker Verification [O] . 2014

机译：患者验证的抗康诵和语音辨别特征的组合
8. Feature-Based and Channel-Based Analyses of Intrinsic Variability in Speaker Verification. [R] . Graciarena, M., Bocklet, T., Shriberg, E., 2013

机译：基于特征和基于通道的说话人验证中内在变异性分析。

Domain compensation based on phonetically discriminative features for speaker verification

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅