首页> 外文会议>Annual conference of the International Speech Communication Association >A Two-stage Speaker Adaptation Approach for Subspace Gaussian Mixture Model based Nonnative Speech Recognition
【24h】

A Two-stage Speaker Adaptation Approach for Subspace Gaussian Mixture Model based Nonnative Speech Recognition

机译:基于非本地语音识别的子空间高斯混合模型的两阶段说话人自适应方法

获取原文

摘要

Nonnative speech recognition is becoming more and more important as many speech applications are deployed world wide. Meanwhile, due to the large population of nonnative speakers, speaker adaptation remains the most practical way for providing high performance speech services. Subspace Gaussian Mixture Model (SGMM) has recently been shown to yield superior performance on various native speech recognition tasks. In this paper, we investigated different speaker adaptation techniques of SGMM for nonnative speech recognition. A two-stage direct model adaptation approach has been proposed based on the analysis of SGMM model parameter functionalities. Our initial experiments have also verified that the proposed approach is much more effective than the traditional feature-space Maximum Likelihood Linear Regression(MLLR) on SGMM based nonnative speaker adaptation tasks.
机译:随着世界范围内部署了许多语音应用程序,非本地语音识别变得越来越重要。同时,由于非母语使用者的人数众多,说话人适应仍然是提供高性能语音服务的最实用方法。子空间高斯混合模型(SGMM)最近在各种本地语音识别任务中表现出了卓越的性能。在本文中,我们研究了SGMM用于非本地语音识别的不同说话人自适应技术。在分析SGMM模型参数功能的基础上,提出了一种两阶段直接模型自适应方法。我们的初步实验还证明,在基于SGMM的非母语说话人自适应任务上,该方法比传统的特征空间最大似然线性回归(MLLR)更为有效。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号