首页> 外文会议>International Conference on Digital Signal Processing >Improved gender/age recognition system using arousal-selection and feature-selection schemes
【24h】

Improved gender/age recognition system using arousal-selection and feature-selection schemes

机译:使用唤醒选择和特征选择方案的改进的性别/年龄识别系统

获取原文
获取外文期刊封面目录资料

摘要

This work proposes the arousal-selection and feature-selection schemes to improve speaker's gender and age identification performance. Our previous results showed that gender and age recognition rates would increase as affective stimulation degrees were lower and higher, respectively. Considering a practical scenario, the speaker's mood does not alter frequently, so speech frames are partitioned into two groups with low and high arousal levels. Here, two Gaussian Mixture Model (GMM) probability density functions are employed to characterize the distributions of the degrees of speech stimuli in terms of tone and energy variations. Such approach can appropriately classify speech frames and easily adapt to different speakers. As well as speech frames are fairly filtered and partitioned, the feature-selection scheme is effectively used to determine adequate low-level features. To do fair comparison, the experiment database adopts Lwazi corpus from South Africa. The proposed system using the arousal-selection and feature-selection schemes exhibits that accuracy rates of gender and age estimations reach 98.9% and 71.6% with 1.7% and 10.8% increases, respectively, as compared to the ones without using arousal-selection and feature-selection schemes. Therefore, the recognition system proposed herein successfully enhances accuracy rates of age and gender estimations for various human-machine interaction and multimedia applications.
机译:这项工作提出了唤醒选择和特征选择方案,以提高说话者的性别和年龄识别性能。我们以前的结果表明,随着情感刺激程度的降低和升高,性别和年龄的识别率将分别提高。考虑到实际情况,说话者的情绪不会经常改变,因此语音帧被分为具有低和高唤醒水平的两组。在此,采用两个高斯混合模型(GMM)概率密度函数来表征语音刺激程度在音调和能量变化方面的分布。这种方法可以适当地对语音帧进行分类,并轻松适应不同的说话者。不仅对语音帧进行了合理的过滤和划分,特征选择方案还可以有效地用于确定足够的低级特征。为了进行公平的比较,实验数据库采用了南非的Lwazi语料库。提出的使用唤醒选择和特征选择方案的系统显示,与不使用唤醒选择和特征的系统相比,性别和年龄估计的准确率分别达到98.9%和71.6%,分别提高了1.7%和10.8%。选择方案。因此,本文提出的识别系统成功地提高了用于各种人机交互和多媒体应用的年龄和性别估计的准确率。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号