首页> 外文会议>IEEE International Conference on Acoustics, Speech and Signal Processing;ICASSP >Latent variable speaker adaptation of Gaussian mixture weights and means
【24h】

Latent variable speaker adaptation of Gaussian mixture weights and means

机译:潜在变量说话人对高斯混合权重和均值的适应

获取原文

摘要

We describe a novel fast speaker adaptation algorithm for large vocabulary speech recognition systems, which adapts both the Gaussian means and the mixture weights. Gaussian means are expressed as a linear combination of eigenvoices estimated with principal component analysis. The non-negative Gaussian mixture weights are expressed as a linear combination of a set of latent vectors estimated with non-negative matrix factorization. Experiments on the Wall Street Journal database show that the combination of weight and mean adaptation consistently improves the performance compared to eigenvoice adaptation only. Improvements up to 5.8% relative word error rate reduction were observed with 40 eigenvoices and 40 latent weight vectors. Furthermore, combining weight and mean adaptation outperformed both weight and mean adaptation on itself, even if the latter uses more latent vectors.
机译:我们描述了一种适用于大型词汇语音识别系统的新颖的快速说话人自适应算法,该算法可以同时适应高斯方法和混合权重。高斯均值表示为通过主成分分析估算的特征语音的线性组合。非负高斯混合权重表示为通过非负矩阵分解估计的一组潜在向量的线性组合。 《华尔街日报》数据库的实验表明,权重和均值自适应的组合与仅本征语音自适应相比,始终可以提高性能。使用40个特征语音和40个潜在权重矢量可以观察到相对词错误率降低高达5.8%的改进。此外,即使权重和均值自适应使用更多的潜在向量,将权重和均值自适应结合起来也优于权重和均值自适应。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号