首页> 外文会议>International Conference on Bio-inspired Systems and Signal Processing >DNN-based Models for Speaker Age and Gender Classification
【24h】

DNN-based Models for Speaker Age and Gender Classification

机译:基于DNN的演讲者年龄和性别分类模型

获取原文

摘要

Automatic speaker age and gender classification is an active research field due to the continuous and rapid development of applications related to humans' life and health. In this paper, we propose a new method for speaker age and gender classification, which utilizes deep neural networks (DNNs) as feature extractor and classifier. The proposed method creates a model for each speaker. For each test speech utterance, the similarity between the test model and the speaker class models are compared. Two feature sets have been used: Mel-frequency cepstral coefficients (MFCCs) and shifted delta cepstral (SDC) coefficients. The proposed model by using the SDC feature set achieved better classification results than that of MFCCs. The experimental results showed that the proposed SDC speaker model + SDC class model outperformed all the other systems by achieving 57.21% overall classification accuracy.
机译:自动演讲者年龄和性别分类是一个积极的研究领域,因为与人类生活和健康有关的申请的连续和快速发展。在本文中,我们提出了一种新的发言者年龄和性别分类方法,它利用深神经网络(DNN)作为特征提取器和分类器。该方法为每个扬声器创建模型。对于每个测试语音话语,比较测试模型和扬声器类模型之间的相似性。已经使用了两个特征集:熔融频率谱系数(MFCC)和移位的Delta谱(SDC)系数。通过使用SDC功能集的建议模型实现了比MFCC的更好的分类结果。实验结果表明,所提出的SDC扬声器型号+ SDC类模型通过实现总体分类准确性57.21%,表现优于所有其他系统。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号