首页> 外文会议>Odyssey 2010: the speaker and language recognition workshop >Joint Factor Analysis for Speaker Recognition reinterpreted as Signal Coding using Overcomplete Dictionaries
【24h】

Joint Factor Analysis for Speaker Recognition reinterpreted as Signal Coding using Overcomplete Dictionaries

机译:使用超完备字典将说话人识别的联合因素分析重新解释为信号编码

获取原文
获取原文并翻译 | 示例

摘要

This paper presents a reinterpretation of Joint Factor Analysis as a signal approximation methodology—based on ridge regression—using an overcomplete dictionary learned from data. A non-probabilistic perspective of the three fundamental steps in the JFA paradigm based on point estimates is provided. That is, model training, hyperparameter estimation and scoring stages are equated to signal coding, dictionary learning and similarity computation respectively. Establishing a connection between these two well-researched areas opens the doors for cross-pollination between both fields. As an example of this, we propose two novel ideas that arise naturally form the non-probabilistic perspective and result in faster hyperparameter estimation and improved scoring. Specifically, the proposed technique for hyperparameter estimation avoids the need to use explicit matrix inversions in the M-step of the ML estimation. This allows the use of faster techniques such as Gauss-Seidel or Cholesky factorizations for the computation of the posterior means of the factors x, y and z during the E-step. Regarding the scoring, a similarity measure based on a normalized inner product is proposed and shown to outperform the state-of-the-art linear scoring approach commonly used in JFA. Experimental validation of these two novel techniques is presented using closed-set identification and speaker verification experiments over the Switchboard database.
机译:本文提出了联合因子分析的重新解释,它是基于岭回归的信号近似方法,它使用了从数据中学到的过完整的字典。提供了基于点估计的JFA范例中三个基本步骤的非概率性观点。也就是说,模型训练,超参数估计和评分阶段分别等于信号编码,字典学习和相似度计算。在这两个经过深入研究的领域之间建立联系为这两个领域之间的异花授粉打开了大门。以此为例,我们提出了两种新颖的想法,它们自然地从非概率的角度出现,并导致更快的超参数估计和改进的评分。特别地,所提出的用于超参数估计的技术避免了在ML估计的M步中使用显式矩阵求逆的需要。这允许使用诸如Gauss-Seidel或Cholesky分解等更快的技术来计算E步中因子x,y和z的后均值。关于计分,提出了一种基于归一化内积的相似性度量,该度量显示出优于JFA中常用的最新线性计分方法。这两种新颖技术的实验验证是通过在Switchboard数据库上进行的封闭式识别和说话人验证实验来进行的。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号