首页> 外文会议>International conference on statistical language and speech processing >Histogram Equalization Using Centroids of Fuzzy C-Means of Background Speakers' Utterances for Speaker Identification
【24h】

Histogram Equalization Using Centroids of Fuzzy C-Means of Background Speakers' Utterances for Speaker Identification

机译:使用背景说话者说话人的模糊C均值质心进行直方图均衡以进行说话人识别

获取原文

摘要

In this paper, we propose a novel approach of histogram equalization for speaker recognition with short utterances which are not enough for building histograms. The proposed method clusters the features of randomly selected background speakers' utterances, and estimates the cumulative distribution using the centroids of the clusters sorted in ascending order and the samples of a short test utterance. The ranks are obtained from the test utterance and the sorted centroid set and the sum of the two ranks are used to estimate the cumulative distribution function. For the evaluation, we use ETRI PC database and simulate VoIP codecs for the test set. The system is compared with other feature normalization methods such as CMN, MVN and the conventional HEQ. Our proposed method reduces the error rates by 27.9%, 35.9%, and 30.1% relatively in the test environments: G.729, SILK and Speex, respectively.
机译:在本文中,我们提出了一种新颖的直方图均衡方法,用于说话人识别,但短发声不足以建立直方图。所提出的方法对随机选择的背景说话者说话的特征进行聚类,并使用以升序排序的聚类的质心和短测试说话的样本来估计累积分布。这些等级是从测试话语和排序的质心集获得的,两个等级的总和用于估计累积分布函数。为了进行评估,我们使用ETRI PC数据库并为测试集模拟VoIP编解码器。该系统与其他特征归一化方法(例如CMN,MVN和常规HEQ)进行了比较。我们提出的方法在测试环境中分别将G.729,SILK和Speex的错误率分别降低了27.9%,35.9%和30.1%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号