Histogram Equalization Using Centroids of Fuzzy C-Means of Background Speakers' Utterances for Speaker Identification

机译：使用背景说话者说话人的模糊C均值质心进行直方图均衡以进行说话人识别

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

In this paper, we propose a novel approach of histogram equalization for speaker recognition with short utterances which are not enough for building histograms. The proposed method clusters the features of randomly selected background speakers' utterances, and estimates the cumulative distribution using the centroids of the clusters sorted in ascending order and the samples of a short test utterance. The ranks are obtained from the test utterance and the sorted centroid set and the sum of the two ranks are used to estimate the cumulative distribution function. For the evaluation, we use ETRI PC database and simulate VoIP codecs for the test set. The system is compared with other feature normalization methods such as CMN, MVN and the conventional HEQ. Our proposed method reduces the error rates by 27.9%, 35.9%, and 30.1% relatively in the test environments: G.729, SILK and Speex, respectively.

机译：在本文中，我们提出了一种新颖的直方图均衡方法，用于说话人识别，但短发声不足以建立直方图。所提出的方法对随机选择的背景说话者说话的特征进行聚类，并使用以升序排序的聚类的质心和短测试说话的样本来估计累积分布。这些等级是从测试话语和排序的质心集获得的，两个等级的总和用于估计累积分布函数。为了进行评估，我们使用ETRI PC数据库并为测试集模拟VoIP编解码器。该系统与其他特征归一化方法（例如CMN，MVN和常规HEQ）进行了比较。我们提出的方法在测试环境中分别将G.729，SILK和Speex的错误率分别降低了27.9％，35.9％和30.1％。

著录项

来源
《International conference on statistical language and speech processing》|2013年|143-151|共9页
会议地点
作者
Myung-Jae Kim; Il-Ho Yang; Ha-Jin Yu;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
speaker recognition; speaker identification; histogram equalization;

机译：说话人识别;说话人识别;直方图均衡;

相似文献

外文文献
中文文献
专利

1. Histogram equalization using a reduced feature set of background speakers’ utterances for speaker recognition [J] . Myung-jae?Kim, Il-ho?Yang, Min-seok?Kim, Frontiers of Information Technology & Electronic Engineering . 2017,第5期

机译：使用减少的背景说话者特征集进行直方图均衡以识别说话者
2. Histogram equalization using a reduced feature set of background speakers' utterances for speaker recognition [J] . Myung-jae KIM, Il-ho YANG, Min-seok KIM, 浙江大学学报（英文版）（C辑：计算机与电子） . 2017,第005期

机译：使用减少的背景说话者说话特征集进行直方图均衡以识别说话者
3. Environmental robust speech and speaker recognition through multi-channel histogram equalization [J] . Stefano Squartini, Emanuele Principi, Rudy Rotili, Neurocomputing . 2012,第1期

机译：通过多通道直方图均衡化实现环境鲁棒的语音和说话人识别
4. Histogram Equalization Using Centroids of Fuzzy C-Means of Background Speakers' Utterances for Speaker Identification [C] . Myung-Jae Kim, Il-Ho Yang, Ha-Jin Yu International Conference on Statistical Language and Speech Processing . 2013

机译：使用模糊C型扬声器的模糊C-inchs'扬声器识别的表达的血管均衡
5. African American English Speakers' Production Demands in Spontaneous Utterances [D] . Mayanja, Seara. 2019

机译：非洲裔美国英语演讲者的生产需求在自发的话语中
6. Speaker-external versus speaker-internal forces on utterance form: Do cognitive demands override threats to referential success? [O] . Liane Wardlow Lane, Victor S. Ferreira -1

机译：说话者对说话者的外部力量与说话者内部的力量形式：认知需求是否超越了指称成功的威胁？
7. Histogram Equalization Using Background Speakers' Utterances for Speaker Identification [O] . Myung-Jae Kim, Il-Ho Yang, Byung-Min So, 2012

机译：使用背景扬声器对扬声器识别的直方图均衡
8. Speaker Recognition from an Unknown Utterance and Speaker-Speech Interaction. [R] . Kashyap, R. L. 1976

机译：来自未知话语和说话者 - 语音交互的说话人识别。

Histogram Equalization Using Centroids of Fuzzy C-Means of Background Speakers' Utterances for Speaker Identification

摘要

著录项

相似文献

相关主题

期刊订阅