首页> 美国政府科技报告 >Channel-Mismatch Compensation in Speaker Identification Feature Selection andAdaptation with Artificial Neural Networks
【24h】

Channel-Mismatch Compensation in Speaker Identification Feature Selection andAdaptation with Artificial Neural Networks

机译:基于人工神经网络的说话人识别特征选择与适应中的信道失配补偿

获取原文

摘要

We develop and present results of an artificial neural network (ANN) basedcompensation technique for mismatched classifier training and testing conditions in speaker identification (SID). One ANN per feature per speaker is trained to perform a mapping of that feature from a corrupted condition to an undistorted condition. Therefore, a classifier trained under one condition may be used to classify data collected under a different condition. Speech utterances from 168 speakers, collected in a studio, and also re-recorded after transmission over telephone networks, are used for developing and testing the method. Peak formant resonant frequencies, their bandwidths, and pitch are used as features. These features from the studio speech are used to train Gaussian Mixture Model classifiers. Portions of the studio and telephone speech are used to train the compensation ANNs. In mismatched train and test conditions, features from telephone speech are modified by the trained ANNs and applied to the GMMs trained with features from studio speech. Without compensation, SID accuracy is 6%. The compensation method developed in this work provides mismatch SID accuracy of 58.3%. Previous research on the same data with the commonly used Mel Frequency Cepstral Coefficients as features and a typical compensation method of Cepstral Mean Subtraction with Band Limiting gives SID accuracy of 27.4% with the same type of classifiers.

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号