首页> 外文会议>International Conference on Signal Processing(ICSP'06); 20061116-20; Guilin(CN) >Speaker age interval and sex identification based on Jitters, Shimmers and Mean MFCC using supervised and unsupervised discriminative classification methods
【24h】

Speaker age interval and sex identification based on Jitters, Shimmers and Mean MFCC using supervised and unsupervised discriminative classification methods

机译:基于抖动,微光和均值MFCC的说话者年龄间隔和性别识别,采用有监督和无监督的判别分类方法

获取原文
获取原文并翻译 | 示例

摘要

Discrimination ability of Speech long term features, including Jitters, Shimmers and Mean MFCC is proposed, for age interval and sex identification. First to make a primary study of discrimination ability, two well-known unsupervised classification methods, i.e. K-Means and FCM, were used. Then, two supervised discriminative classification approaches, namely MLP neural network and SVM, have been employed for more precise age interval and sex identification, tn addition, in order to make a study of mutual influences of age interval and sex discriminative features, a cascade combination of two MLPs neural networks, with one trained for age interval and other one for sex identification, has been utilized separately. Most practical applications of age interval and sex identification are remote applications where usually speech signal is affected by telecommunication channels. To take this affect into consideration, a telephonic database has been used in experiments. Obtained results demonstrate that Jitter and Shimmer have good discrimination ability between male and female or young and old speakers, but do not discriminate small age intervals appropriately. On the other hand, Mean MFCC is not suitable for sex unsupervised classification but leads to an increase in sex supervised classification performance. Also these coefficients contain useful information about speaker age interval, and can result in a decrease in identification error rate.
机译:提出了语音长期特征的识别能力,包括抖动,微光和均值MFCC,用于年龄间隔和性别识别。首先对歧视能力进行初步研究,使用了两种众所周知的无监督分类方法,即K-Means和FCM。然后,采用了两种有监督的判别分类方法,即MLP神经网络和SVM,用于更精确的年龄区间和性别识别,此外,为了研究年龄区间和性别判别特征的相互影响,采用级联组合分别使用了两个MLP神经网络中的一个,其中一个训练了年龄间隔,另一个训练了性别识别。年龄间隔和性别识别的大多数实际应用是远程应用,通常语音信号会受到电信信道的影响。为了考虑这种影响,在实验中使用了电话数据库。所得结果表明,抖动和闪光对男性和女性或年轻和年长的说话者具有良好的辨别能力,但不能适当地区分较小的年龄段。另一方面,Mean MFCC不适用于性别无监督分类,但会导致性别有监督分类性能提高。这些系数还包含有关说话者年龄间隔的有用信息,并且可能导致识别错误率降低。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号