Fuzzy Restricted Boltzmann Machine based Probabilistic Linear Discriminant Analysis for Noise-Robust Text-Dependent Speaker Verification on Short Utterances

Sung-Hyun Yoon; Min-Sung Koh; Ha-Jin Yu

首页> 外文期刊>IAENG Internaitonal journal of computer science >Fuzzy Restricted Boltzmann Machine based Probabilistic Linear Discriminant Analysis for Noise-Robust Text-Dependent Speaker Verification on Short Utterances

【24h】

Fuzzy Restricted Boltzmann Machine based Probabilistic Linear Discriminant Analysis for Noise-Robust Text-Dependent Speaker Verification on Short Utterances

机译：基于模糊的限制Boltzmann Machine基于噪声强制文本依赖扬声器验证的概率线性判别分析

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

In the i-vector-based speaker verification system, it is important to compensate for session variability on the i-vector to improve speaker verification performance. Linear discriminant analysis (LDA) is widely used to compensate for session variability by reducing the dimensionality of the i-vector. Restricted Boltzmann machine (RBM)-based probabilistic linear discriminant analysis (PLDA) has been proposed to improve the session variability compensation ability of LDA. It can be viewed as a probabilistic approach of LDA using RBM. However, since the RBM does not consider uncertainties in obtaining the parameters, the representation capability of RBM-based PLDA is limited. For instance, many real-world speaker verifications must consider noisy environments, which make the compensated session variability uncertain. The fuzzy restricted Boltzmann machine (FRBM) was proposed to improve the capability of the RBM. It showed a more robust performance than that of the RBM. Hence, in this paper, we propose FRBM-based PLDA to improve the representation capability of RBM-PLDA by replacing all the parameters of RBM-PLDA with fuzzy numbers. An evaluation with Part 1 of Robust Speaker Recognition (RSR) 2015 was conducted. In the experimental results, the proposed algorithm shows a better compensation for phonetic variability that exists in short utterances, and a robust speaker verification performance in diverse noisy environments where phonetic and noise variabilities are challenging issues in real-world applications.

机译：在基于I形向量的扬声器验证系统中，重要的是要补偿I-vector上的会话变异，以提高扬声器验证性能。线性判别分析（LDA）广泛用于通过降低I形载体的维度来补偿会话变异性。已经提出了基于限制的Boltzmann机（RBM）基于概率线性判别分析（PLDA）以改善LDA的会话变异补偿能力。它可以被视为使用RBM的LDA的概率方法。然而，由于RBM在获得参数时不考虑不确定性，因此基于RBM的PLDA的表示能力是有限的。例如，许多现实世界的扬声器验证必须考虑嘈杂的环境，这使得补偿会话变异不确定。提出了模糊限制的Boltzmann机（FRBM）以提高RBM的能力。它显示出比RBM更强大的性能。因此，在本文中，我们提出了基于FRBM的PLDA，通过用模糊数取代RBM-PLDA的所有参数来改善RBM-PLDA的表示能力。进行了与强大的扬声器识别（RSR）2015的第1部分的评估。在实验结果中，所提出的算法显示出在短发声中存在的语音变异性的更好补偿，以及在不同噪声环境中的强大扬声器验证性能，其中语音和噪音变量是现实世界应用中的挑战性问题。

著录项

来源
《IAENG Internaitonal journal of computer science》 |2020年第2期|468-480|共13页
作者
Sung-Hyun Yoon; Min-Sung Koh; Ha-Jin Yu;
展开▼
作者单位

School of Computer Science University of Seoul Seoul 02504 Republic of Korea;

School of Computing and Engineering Sciences Eastern Washington University Cheney WA 99004 USA;

School of Computer Science University of Seoul Seoul 02504 Republic of Korea;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Discriminant analysis; fuzzy restricted Boltzmann machine; i-vector; restricted Boltzmann machine; speaker verification;

机译：判别分析;模糊限制Boltzmann机;我矢量;受限制的Boltzmann机器;扬声器验证;

相似文献

外文文献
中文文献
专利

1. A fuzzy-clustering-based hierarchical i-vector/probabilistic inear discriminant analysis system for text-dependent speaker verification [J] . Laskar Mohammad Azharuddin, Laskar Rabul Hussain Expert Systems . 2020,第3期

机译：基于模糊聚类的分层I载体/概率INEAR判别分析分析系统，用于文本依赖扬声器验证
2. Self-segmentation of pass-phrase utterances for deep feature learning in text-dependent speaker verification [J] . Achintya Kumar Sarkar, Zheng-Hua Tan Computer speech and language . 2021,第Nova期

机译：文本依赖扬声器验证中深度特征学习的通行证话语的自我分割
3. Model selection and score normalization for text-dependent single utterance speaker verification [J] . OSMAN BüYüK, MUSTAFA LEVENT ARSLAN Turkish Journal of Electrical Engineering and Computer Sciences . 2012,第Supa2期

机译：模型选择和分数归一化，用于与文本相关的单个说话者说话人验证
4. Generative Adversarial Networks based X-vector Augmentation for Robust Probabilistic Linear Discriminant Analysis in Speaker Verification [C] . Yexin Yang, Shuai Wang, Man Sun, International Symposium on Chinese Spoken Language Processing . 2018

机译：基于生成对抗网络的X向量增强算法，用于说话人验证中的鲁棒概率线性判别分析
5. Efficient Machine Learning Inference for Embedded Systems with Integer Based Restricted Boltzmann Machines Classifiers [D] . Sosa Barillas, Bryan Samuel. 2019

机译：基于整数的受限Boltzmann机器分类器的嵌入式系统有效的机器学习推断
6. Short-time speaker verification with different speaking style utterances [O] . Hongwei Mao, Yan Shi, Yue Liu, 2020

机译：短时间发言者验证不同的说话风格的话语
7. TOWARDS NOISE-ROBUST SPEAKER RECOGNITION USING PROBABILISTIC LINEAR DISCRIMINANT ANALYSIS [O] . Yun Lei, Lukas Burget, Luciana Ferrer, 2013

机译：利用概率线性判别分析实现噪声稳健的扬声器识别

Fuzzy Restricted Boltzmann Machine based Probabilistic Linear Discriminant Analysis for Noise-Robust Text-Dependent Speaker Verification on Short Utterances

摘要

著录项

相似文献

相关主题

期刊订阅