首页> 美国卫生研究院文献>Springer Open Choice >New transformed features generated by deep bottleneck extractor and a GMM–UBM classifier for speaker age and gender classification

【2h】

New transformed features generated by deep bottleneck extractor and a GMM–UBM classifier for speaker age and gender classification

机译：由深瓶颈提取器和GMM–UBM分类器生成的新转换功能用于说话人年龄和性别分类

代理获取

本网站仅为用户提供外文OA文献查询和代理获取服务，本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文，但由于OA文献来源多样且变更频繁，仍可能出现获取不到、文献不完整或与标题不符等情况，如果获取不到我们将提供退款服务。请知悉。

页面导航

摘要
著录项
相似文献
相关主题

摘要

Speaker age and gender classification is one of the most challenging problems in speech signal processing. Recently with developing technologies, identifying speaker age and gender information has become a necessity for speaker verification and identification systems such as identifying suspects in criminal cases, improving human–machine interaction, and adapting music for awaiting people queue. Despite the intensive studies that have been conducted to extract descriptive and distinctive features, the classification accuracies are still not satisfactory. In this work, a model for generating bottleneck features from a deep neural network and a Gaussian Mixture Model–Universal Background Model (GMM–UBM) classifier are proposed for speaker age and gender classification problem. Deep neural network with a bottleneck layer is trained in an unsupervised manner for calculating the initial weights between layers. Then, it is trained and tuned in a supervised manner to generate transformed mel-frequency cepstral coefficients (T-MFCCs). The GMM–UBM is used to build a GMM model for each class, and the models are used to classify speaker age and gender. Age-annotated database of German telephone speech (aGender) is used to evaluate the proposed classification system. The newly generated T-MFCCs have shown potential to achieve significant classification improvements in speaker age and gender classification by using the GMM–UBM classifier. The proposed classification system achieved an overall accuracy of 57.63%. The highest accuracy is calculated as 72.97% for adult female speakers.

机译：说话者的年龄和性别分类是语音信号处理中最具挑战性的问题之一。近年来，随着技术的发展，识别说话者的年龄和性别信息已成为说话者验证和识别系统的必要条件，例如识别刑事案件中的犯罪嫌疑人，改善人机交互以及使音乐适应排队等候。尽管已进行了大量研究以提取描述性和独特性，但分类准确性仍不令人满意。在这项工作中，针对说话者的年龄和性别分类问题，提出了一个用于从深层神经网络生成瓶颈特征的模型以及一个高斯混合模型-通用背景模型（GMM-UBM）分类器。具有瓶颈层的深度神经网络以无监督的方式进行训练，以计算层之间的初始权重。然后，以有监督的方式对其进行训练和调谐，以生成变换的梅尔频率倒谱系数（T-MFCC）。 GMM–UBM用于为每个课程建立GMM模型，并且该模型用于对说话者的年龄和性别进行分类。带有年龄注释的德国电话语音（aGender）数据库用于评估建议的分类系统。通过使用GMM-UBM分类器，新生成的T-MFCC已显示出在说话者年龄和性别分类方面实现显着分类改进的潜力。拟议的分类系统实现了57.63％的整体准确性。据计算，成年女性演讲者的最高准确性为72.97％。

著录项

期刊名称 Springer Open Choice
作者
Arafat Abu Mallouh; Zakariya Qawaqneh; Buket D. Barkana;
展开▼
作者单位

展开▼
年(卷),期 -1(30),8
年度 -1
页码 2581–2593
总页数 13
原文格式 PDF
正文语种
中图分类外科学;
关键词
Speaker recognition Age and gender Classification MFCCs Deep neural network DBF extractor;

机译：说话人识别;年龄和性别;分类;MFCC;深度神经网络;DBF提取器;

相似文献

外文文献
中文文献
专利

1. New transformed features generated by deep bottleneck extractor and a GMM-UBM classifier for speaker age and gender classification [J] . Abu Mallouh Arafat, Qawaqneh Zakariya, Barkana Buket D. Neural computing & applications . 2018,第8期

机译：深瓶颈提取器生成的新型转换功能和发言者年龄和性别分类的GMM-UBM分类器
2. Gammachirp Filter Banks Applied in Roust Speaker Recognition Based on GMM-UBM Classifier [J] . Deng Lei, Gao Yong The international arab journal of information technology . 2020,第2期

机译：基于GMM-UBM分类器的ROUST扬声器识别伽马基杂交滤波器银行
3. Deep neural network framework and transformed MFCCs for speaker's age and gender classification [J] . Qawaqneh Zakariya, Abu Mallouh Arafat, Barkana Buket D. Knowledge-Based Systems . 2017,第JANa1期

机译：深度神经网络框架和转换后的MFCC用于说话人的年龄和性别分类
4. Comparison of LPCC and MFCC features and GMM and GMM-UBM modeling for limited data speaker verification [C] . Jayanthi Kumari T.R., Jayanna H.S. IEEE International Conference on Computational Intelligence and Computing Research . 2014

机译：LPCC和MFCC功能以及GMM和GMM-UBM建模的比较，用于有限的数据说话者验证
5. A Framework for Enhancing Speaker Age and Gender Classification by Using a New Feature Set and Deep Neural Network Architectures [D] . Abumallouh, Arafat. 2017

机译：通过使用新功能集和深度神经网络体系结构提高演讲者年龄和性别分类的框架
6. Medical Image Classification Based on Deep Features Extracted by Deep Model and Statistic Feature Fusion with Multilayer Perceptron‬ [O] . ZhiFei Lai, HuiFang Deng 2018

机译：基于深度模型提取的深度特征和多层感知器统计特征融合的医学图像分类
7. New transformed features generated by deep bottleneck extractor and a GMM–UBM classifier for speaker age and gender classification [O] . Arafat Abu Mallouh, Zakariya Qawaqneh, Buket D. Barkana 2017

机译：由深瓶颈提取器和GMM–UBM分类器生成的新转换功能，用于说话人年龄和性别分类
8. Classification of JERS-1 Image Mosaic of Central Africa Using A Supervised Multiscale Classifier of Texture Features [R] . Saatchi, Sassan, DeGrandi, Franco, Simard, Marc, 1999

机译：利用有监督的多尺度纹理特征分类器对中非JERs-1图像拼接进行分类

New transformed features generated by deep bottleneck extractor and a GMM–UBM classifier for speaker age and gender classification

摘要

著录项

相似文献

相关主题

期刊订阅