A Two-stage Speaker Adaptation Approach for Subspace Gaussian Mixture Model based Nonnative Speech Recognition

机译：基于非本地语音识别的子空间高斯混合模型的两阶段说话人自适应方法

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Nonnative speech recognition is becoming more and more important as many speech applications are deployed world wide. Meanwhile, due to the large population of nonnative speakers, speaker adaptation remains the most practical way for providing high performance speech services. Subspace Gaussian Mixture Model (SGMM) has recently been shown to yield superior performance on various native speech recognition tasks. In this paper, we investigated different speaker adaptation techniques of SGMM for nonnative speech recognition. A two-stage direct model adaptation approach has been proposed based on the analysis of SGMM model parameter functionalities. Our initial experiments have also verified that the proposed approach is much more effective than the traditional feature-space Maximum Likelihood Linear Regression(MLLR) on SGMM based nonnative speaker adaptation tasks.

机译：随着世界范围内部署了许多语音应用程序，非本地语音识别变得越来越重要。同时，由于非母语使用者的人数众多，说话人适应仍然是提供高性能语音服务的最实用方法。子空间高斯混合模型（SGMM）最近在各种本地语音识别任务中表现出了卓越的性能。在本文中，我们研究了SGMM用于非本地语音识别的不同说话人自适应技术。在分析SGMM模型参数功能的基础上，提出了一种两阶段直接模型自适应方法。我们的初步实验还证明，在基于SGMM的非母语说话人自适应任务上，该方法比传统的特征空间最大似然线性回归（MLLR）更为有效。

著录项

来源
《Annual conference of the International Speech Communication Association》|2012年|1770-1773|共4页
会议地点
作者
Bo LI; Khe Chai SIM;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Speaker Adaptation; Nonnative Speech Recognition; Subspace Gaussian Mixture Model;

机译：说话人适应;非母语语音识别;子空间高斯混合模型;

相似文献

外文文献
中文文献
专利

1. Fast model selection based speaker adaptation for nonnative speech [J] . Xiaodong He, Yunxin Zhao IEEE Transactions on Speech and Audio Proceessing . 2003,第4期

机译：基于快速模型选择的非母语语音说话人自适应
2. Regularized Subspace Gaussian Mixture Models for Speech Recognition [J] . Liang Lu, Ghoshal A., Renals S. Signal Processing Letters, IEEE . 2011,第7期

机译：用于语音识别的正则化子空间高斯混合模型
3. The subspace Gaussian mixture model-A structured model for speech recognition [J] . Daniel Povey, Lukas Burget, Mohit Agarwal, Computer speech and language . 2011,第2期

机译：子空间高斯混合模型-一种语音识别的结构化模型
4. A Two-stage Speaker Adaptation Approach for Subspace Gaussian Mixture Model based Nonnative Speech Recognition [C] . Bo LI, Khe Chai SIM INTERSPEECH 2012 . 2012

机译：基于子空间高斯混合模型的非舞蹈语音识别的两阶段扬声器适应方法
5. Model selection based speaker adaptation and its application to nonnative speech recognition. [D] . He, Xiaodong. 2003

机译：基于模型选择的说话人自适应及其在非本地语音识别中的应用。
6. Detecting Manic State of Bipolar Disorder Based on Support Vector Machine and Gaussian Mixture Model Using Spontaneous Speech [O] . Zhongde Pan, Chao Gui, Jing Zhang, 2018

机译：基于支持向量机和高斯混合模型的自发性语音躁狂状态检测
7. Maximum a posteriori adaptation of subspace Gaussian mixture models for cross-lingual speech recognition [O] . Liang Lu, Arnab Ghoshal, Steve Renals 2012

机译：子空间高斯混合模型的最大后验自适应用于跨语言语音识别

A Two-stage Speaker Adaptation Approach for Subspace Gaussian Mixture Model based Nonnative Speech Recognition

摘要

著录项

相似文献

相关主题

期刊订阅