首页> 外文期刊>IAENG Internaitonal journal of computer science >Fuzzy Restricted Boltzmann Machine based Probabilistic Linear Discriminant Analysis for Noise-Robust Text-Dependent Speaker Verification on Short Utterances
【24h】

Fuzzy Restricted Boltzmann Machine based Probabilistic Linear Discriminant Analysis for Noise-Robust Text-Dependent Speaker Verification on Short Utterances

机译:基于模糊的限制Boltzmann Machine基于噪声强制文本依赖扬声器验证的概率线性判别分析

获取原文
获取原文并翻译 | 示例
           

摘要

In the i-vector-based speaker verification system, it is important to compensate for session variability on the i-vector to improve speaker verification performance. Linear discriminant analysis (LDA) is widely used to compensate for session variability by reducing the dimensionality of the i-vector. Restricted Boltzmann machine (RBM)-based probabilistic linear discriminant analysis (PLDA) has been proposed to improve the session variability compensation ability of LDA. It can be viewed as a probabilistic approach of LDA using RBM. However, since the RBM does not consider uncertainties in obtaining the parameters, the representation capability of RBM-based PLDA is limited. For instance, many real-world speaker verifications must consider noisy environments, which make the compensated session variability uncertain. The fuzzy restricted Boltzmann machine (FRBM) was proposed to improve the capability of the RBM. It showed a more robust performance than that of the RBM. Hence, in this paper, we propose FRBM-based PLDA to improve the representation capability of RBM-PLDA by replacing all the parameters of RBM-PLDA with fuzzy numbers. An evaluation with Part 1 of Robust Speaker Recognition (RSR) 2015 was conducted. In the experimental results, the proposed algorithm shows a better compensation for phonetic variability that exists in short utterances, and a robust speaker verification performance in diverse noisy environments where phonetic and noise variabilities are challenging issues in real-world applications.
机译:在基于I形向量的扬声器验证系统中,重要的是要补偿I-vector上的会话变异,以提高扬声器验证性能。线性判别分析(LDA)广泛用于通过降低I形载体的维度来补偿会话变异性。已经提出了基于限制的Boltzmann机(RBM)基于概率线性判别分析(PLDA)以改善LDA的会话变异补偿能力。它可以被视为使用RBM的LDA的概率方法。然而,由于RBM在获得参数时不考虑不确定性,因此基于RBM的PLDA的表示能力是有限的。例如,许多现实世界的扬声器验证必须考虑嘈杂的环境,这使得补偿会话变异不确定。提出了模糊限制的Boltzmann机(FRBM)以提高RBM的能力。它显示出比RBM更强大的性能。因此,在本文中,我们提出了基于FRBM的PLDA,通过用模糊数取代RBM-PLDA的所有参数来改善RBM-PLDA的表示能力。进行了与强大的扬声器识别(RSR)2015的第1部分的评估。在实验结果中,所提出的算法显示出在短发声中存在的语音变异性的更好补偿,以及在不同噪声环境中的强大扬声器验证性能,其中语音和噪音变量是现实世界应用中的挑战性问题。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号