Speaker Recognition Based on Gaussian Mixture Models Using Variational Bayesian Method

Tatsuya ITO; Kei HASHIMOTO; Yoshihiko NANKAKU; Akinobu LEE; Keiichi TOKUDA

首页> 外文期刊>電子情報通信学会技術研究報告 >Speaker Recognition Based on Gaussian Mixture Models Using Variational Bayesian Method

【24h】

Speaker Recognition Based on Gaussian Mixture Models Using Variational Bayesian Method

机译：基于高斯混合模型的变分贝叶斯方法说话人识别

获取原文

获取原文并翻译 | 示例

获取外文期刊封面封底 >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

This paper presents a speaker identification system based on Gaussian Mixture Models (GMM) using the variational Bayesian method. Maximum Likelihood (ML) and Maximum A Posterior (MAP) are well-known methods for estimating GMM parameters. However, the overtraining problem occurs with insufficient data due to a point estimate of model parameters. The Bayesian approach estimates a posterior distribution of model param eters and achieves a robust prediction. To solve complicated integral calculations in the Bayesian approach, the variational Bayesian method has been proposed. This paper investigates the performance of the Bayesian approach in large speaker identification tasks.%ガウス混合モデル（Gaussian Mixture Model；GMM）に基づく話者認識は，これまで尤度最大化（Maximum Likelihood；ML）基準や事後確率最大化（Maximum a Posterior；MAP）基準が用いられてきた．しかし，ML基準は．モデルパラメータを確定的変数として点推定するため，学習データが十分に得られない場合，モデルの推定精度が低化する可能性がある．この問題に対し．近年，変分ベイズ法が提案され，様々なモデルにべイズ学習が適用可能となった.本研究では，GMMの学習に変分ベイズ法を適用し，ベイズ基準による話者認識の有効性について検討する．

机译：本文提出了一种基于Gaussian混合模型（GMM）的说话人识别系统，采用了变分贝叶斯方法，最大似然（ML）和最大后验（MAP）是估计GMM参数的著名方法，但是，过度训练会出现问题。贝叶斯方法估计了模型参数的后验分布并实现了稳健的预测。贝叶斯方法为解决复杂的积分计算问题，提出了变分贝叶斯方法。迄今为止，已经使用最大似然（ML）标准和后验概率最大化（Maximum a）进行了基于高斯混合模型（GMM）的说话人识别。已使用后验（MAP）标准。但是，ML标准是。由于模型参数被估计为确定性变量，因此如果没有足够的训练数据，则模型的估计精度可能会降低。对于这个问题。近年来，提出了变分贝叶斯方法，并将贝叶斯学习方法应用到各种模型中，本文将变分贝叶斯方法应用于GMM学习中，证明了基于贝叶斯准则的说话人识别的有效性。考虑。

著录项

来源
《電子情報通信学会技術研究報告》 |2008年第338期|p.185-190|共6页
作者
Tatsuya ITO; Kei HASHIMOTO; Yoshihiko NANKAKU; Akinobu LEE; Keiichi TOKUDA;
展开▼
作者单位

Department of Computer Science and Engineering, Nagoya Institute of Technology Gokiso-cho,Showa-ku, Nagoya, 466-8555 Japan;

Department of Computer Science and Engineering, Nagoya Institute of Technology Gokiso-cho,Showa-ku, Nagoya, 466-8555 Japan;

Department of Computer Science and Engineering, Nagoya Institute of Technology Gokiso-cho,Showa-ku, Nagoya, 466-8555 Japan;

Department of Computer Science and Engineering, Nagoya Institute of Technology Gokiso-cho,Showa-ku, Nagoya, 466-8555 Japan;

Department of Computer Science and Engineering, Nagoya Institute of Technology Gokiso-cho,Showa-ku, Nagoya, 466-8555 Japan;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
speaker recognition; GMM; variational bayesian method;

机译：说话人识别;GMM;变分贝叶斯方法;

相似文献

外文文献
中文文献
专利

1. Speaker Recognition Based on Gaussian Mixture Models Using Variational Bayesian Method [J] . Tatsuya ITO, Kei HASHIMOTO, Yoshihiko NANKAKU, 電子情報通信学会技術研究報告. 音声. Speech . 2008,第338期

机译：基于高斯混合模型的变分贝叶斯方法说话人识别
2. Speaker Recognition Based on Gaussian Mixture Models Using Variational Bayesian Method [J] . Tatsuya ITO, Kei HASHIMOTO, Yoshihiko NANKAKU, 電子情報通信学会技術研究報告. 言語理解とコミュニケーション. Natural Language Understanding and Models of Communication . 2008,第337期

机译：基于高斯混合模型的变分贝叶斯方法说话人识别
3. Speaker Recognition Based on Gaussian Mixture Models Using Variational Bayesian Method [J] . Tatsuya ITO, Kei HASHIMOTO, Yoshihiko NANKAKU, 電子情報通信学会技術研究報告 . 2008,第337期

机译：基于高斯混合模型的变分贝叶斯方法说话人识别
4. Comparison of Speaker Segmentation Methods Based on the Bayesian Information Criterion and Adapted Gaussian Mixture Models [C] . Matej Grasic, Marko Kos, Andrej Zgank, International Conference on Systems, Signals and Image Processing . 2008

机译：基于贝叶斯信息标准和适应高斯混合模型的扬声器分割方法的比较
5. Using Gaussian mixture model and partial least squares regression classifiers for robust speaker verification with various enhancement methods [D] . Edwards, Joshua Scott. 2017

机译：使用Gaussian混合模型和偏最小二乘回归分类，用于具有各种增强方法的强大扬声器验证
6. A Method for Broccoli Seedling Recognition in Natural Environment Based on Binocular Stereo Vision and Gaussian Mixture Model [O] . Luzhen Ge, Zhilun Yang, Zhe Sun, 2019

机译：基于双目立体视觉和高斯混合模型的自然环境中西兰花苗种识别方法
7. A GAUSSIAN MIXTURE MODEL-BASED SPEAKER RECOGNITION SYSTEM [O] . Kumari Piu Gorai, Thomas Abraham 2017

机译：基于高斯混合模型的扬声器识别系统

Speaker Recognition Based on Gaussian Mixture Models Using Variational Bayesian Method

摘要

著录项

相似文献

相关主题

期刊订阅