Minimum Classification Error Training for Speaker Identification Using Gaussian Mixture Models Based on Multi-Space Probability Distribution

机译：基于多空间概率分布的高斯混合模型用于说话人识别的最小分类误差训练

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

In our previous work, we have proposed a speaker modeling technique using spectral and pitch features for text-independent speaker identification based on Multi-Space Probability Distribution Gaussian Mixture Models (MSD-GMMs). We have presented a maximum likelihood (ML) estimation procedure for the MSD-GMM parameters and demonstrated its high recognition performance. In this paper, we describe an minimum classification error (MCE) training procedure for the MSD-GMM speaker models. MCE training is also applied to automatically estimate mixture-dependent stream weights for spectral and pitch streams. The MCE-based MSD-GMM speaker models are evaluated for a text-independent speaker identification task. Experimental results show that MCE training of the MSD-GMM parameters significantly reduces identification errors and system performance is further improved by appropriately weighting spectral and pitch streams using MCE training.

机译：在我们之前的工作中，我们提出了一种基于频谱和音高特征的说话人建模技术，用于基于多空间概率分布高斯混合模型（MSD-GMM）的与文本无关的说话人识别。我们已经提出了MSD-GMM参数的最大似然（ML）估计程序，并展示了其高识别性能。在本文中，我们描述了MSD-GMM扬声器模型的最小分类误差（MCE）训练过程。 MCE训练也适用于自动估计光谱流和沥青流的混合物相关流权重。对基于MCE的MSD-GMM扬声器模型进行评估，以执行与文本无关的扬声器识别任务。实验结果表明，通过MCE训练对频谱和音高流进行适当加权，可以对MSD-GMM参数进行MCE训练，显着减少了识别错误，并且进一步提高了系统性能。

著录项

来源
《European Conference on Speech Communication and Technology v.4; 20010903-20010907; Aalborg; DK》|2001年|P.2837-2840|共4页
会议地点 Aalborg(DK);Aalborg(DK)
作者
Chiyomi Miyajima; Keiichi Tokuda; Tadashi Kitamura;
展开▼
作者单位

Department of Computer Science, Nagoya Institute of Technology, Nagoya 466-8555, Japan;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类传播理论;
关键词

相似文献

外文文献
中文文献
专利

1. Text-Independent Speaker Identification Using Gaussian Mixture Models Based on Multi-Space Probability Distribution [J] . Chiyomi Miyajima, Yosuke Hattori, Keiichi Tokuda IEICE Transactions on Information and Systems . 2001,第7期

机译：基于多空间概率分布的高斯混合模型与文本无关的说话人识别
2. Speaker Clustering Using Decision Tree-Based Phone Cluster Models With Multi-Space Probability Distributions [J] . Han-Ping Shen, Jui-Feng Yeh, Chung-Hsien Wu Audio, Speech, and Language Processing, IEEE Transactions on . 2011,第5期

机译：使用基于决策树的具有多空间概率分布的电话聚类模型进行说话人聚类
3. Research on the error probability distribution of photovoltaic output prediction based on output fluctuation characteristics and Generalized Gaussian Mixture Model [J] . Peng Yan, Chenmeng Xiang, Wen Zhou, E3S Web of Conferences . 2021,第a期

机译：基于输出波动特性和广义高斯混合模型的光伏输出预测误差概率分布研究
4. Minimum Classification Error Training for Speaker Identification Using Gaussian Mixture Models Based on Multi-Space Probability Distribution [C] . Chiyomi Miyajima, Keiichi Tokuda, Tadashi Kitamura European conference on speech communication and technology . 2001

机译：基于多空间概率分布的高斯混合模型的扬声器识别最低分类错误培训
5. A software based speaker identification system using Gaussian mixture model classification. [D] . Reynolds, Ryan M. 2005

机译：使用高斯混合模型分类的基于软件的说话人识别系统。
6. Two step Gaussian mixture model approach to characterize white matter disease based on distributional changes [O] . Namhee Kim, Moonseong Heo, Roman Fleysher, -1

机译：基于分布变化的两步高斯混合模型方法表征白质病
7. Speaker Identification Using Gaussian Mixture Models Based On Multi-Space Probability Distribution [O] . Chiyomi Miyajima, Yosuke Hattori, Keiichi Tokuda, 2001

机译：基于多空间概率分布的高斯混合模型说话人识别

Minimum Classification Error Training for Speaker Identification Using Gaussian Mixture Models Based on Multi-Space Probability Distribution

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅