Speaker age interval and sex identification based on Jitters, Shimmers and Mean MFCC using supervised and unsupervised discriminative classification methods

机译：基于抖动，微光和均值MFCC的说话者年龄间隔和性别识别，采用有监督和无监督的判别分类方法

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

Discrimination ability of Speech long term features, including Jitters, Shimmers and Mean MFCC is proposed, for age interval and sex identification. First to make a primary study of discrimination ability, two well-known unsupervised classification methods, i.e. K-Means and FCM, were used. Then, two supervised discriminative classification approaches, namely MLP neural network and SVM, have been employed for more precise age interval and sex identification, tn addition, in order to make a study of mutual influences of age interval and sex discriminative features, a cascade combination of two MLPs neural networks, with one trained for age interval and other one for sex identification, has been utilized separately. Most practical applications of age interval and sex identification are remote applications where usually speech signal is affected by telecommunication channels. To take this affect into consideration, a telephonic database has been used in experiments. Obtained results demonstrate that Jitter and Shimmer have good discrimination ability between male and female or young and old speakers, but do not discriminate small age intervals appropriately. On the other hand, Mean MFCC is not suitable for sex unsupervised classification but leads to an increase in sex supervised classification performance. Also these coefficients contain useful information about speaker age interval, and can result in a decrease in identification error rate.

机译：提出了语音长期特征的识别能力，包括抖动，微光和均值MFCC，用于年龄间隔和性别识别。首先对歧视能力进行初步研究，使用了两种众所周知的无监督分类方法，即K-Means和FCM。然后，采用了两种有监督的判别分类方法，即MLP神经网络和SVM，用于更精确的年龄区间和性别识别，此外，为了研究年龄区间和性别判别特征的相互影响，采用级联组合分别使用了两个MLP神经网络中的一个，其中一个训练了年龄间隔，另一个训练了性别识别。年龄间隔和性别识别的大多数实际应用是远程应用，通常语音信号会受到电信信道的影响。为了考虑这种影响，在实验中使用了电话数据库。所得结果表明，抖动和闪光对男性和女性或年轻和年长的说话者具有良好的辨别能力，但不能适当地区分较小的年龄段。另一方面，Mean MFCC不适用于性别无监督分类，但会导致性别有监督分类性能提高。这些系数还包含有关说话者年龄间隔的有用信息，并且可能导致识别错误率降低。

著录项

来源
《International Conference on Signal Processing(ICSP'06); 20061116-20; Guilin(CN)》|2006年|P.684-687|共4页
会议地点 Guilin(CN)
作者
A. Sadeghi Naini; M. M. Homayounpour;
展开▼
作者单位

Laboratory for Intelligent Sound and Speech Processing, Computer Engineering and IT Department, Amirkabir University of technology (Tehran Polytechnics), 424 Hafez Avemue, Tehran, Iran;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类通信理论;
关键词
age sex identification; jitter; shimmer; mean MFCC; MLP; SVM; K-Means; FCM; cascade combination;

机译：年龄和性别识别；抖动；闪光；平均MFCC； MLP； SVM； K均值； FCM；级联;

相似文献

外文文献
中文文献
专利

1. Deep neural network framework and transformed MFCCs for speaker's age and gender classification [J] . Qawaqneh Zakariya, Abu Mallouh Arafat, Barkana Buket D. Knowledge-Based Systems . 2017,第JANa1期

机译：深度神经网络框架和转换后的MFCC用于说话人的年龄和性别分类
2. Rmixmod: The R Package of the Model-Based Unsupervised, Supervised, and Semi-Supervised Classification Mixmod Library [J] . Rémi Lebret, Serge Iovleff, Florent Langrognet, Journal of Statistical Software . 2015,第1期

机译：Rmixmod：基于模型的无监督，受监督和半监督分类Mixmod库的R包
3. Discriminative likelihood score weighting based on acoustic-phonetic classification for speaker identification [J] . Youngjoo Suh, Hoirin Kim EURASIP journal on advances in signal processing . 2014,第1期

机译：基于语音分类的判别似然评分加权用于说话人识别
4. Speech analysis in search of speakers with MFCC, PLP, Jitter and Shimmer [C] . Imen Daly, Zied Hajaiej, Ali Gharsallah 2017 International Conference on Advanced Systems and Electric Technologies . 2017

机译：语音分析，用于使用MFCC，PLP，抖动和微光搜索说话者
5. Classifying Land Use/Land Cover Change Over Time within the Watershed Boundary of Keenjhar Lake Using Supervised, Unsupervised, and Hybrid Classification Methods [D] . Henry, Katherine Rae. 2021

机译：分类土地使用/陆地覆盖随着时间的推移在Keenjhar Lake的流域边界内使用监督，无监督和混合分类方法而变化
6. Comparison of Supervised and Unsupervised Deep Learning Methods for Medical Image Synthesis between Computed Tomography and Magnetic Resonance Images [O] . Yafen Li, Wen Li, Jing Xiong, 2020

机译：计算机断层扫描与磁共振图像中医学图像综合的监督和无监督深度学习方法的比较
7. Development of supervised and unsupervised pixel based classification methods for medical image segmentation [O] . Σπυρίδων Κωστόπουλος -1

机译：基于监督和无监督像素的医学图像分割分类方法的开发

Speaker age interval and sex identification based on Jitters, Shimmers and Mean MFCC using supervised and unsupervised discriminative classification methods

摘要

著录项

相似文献

相关主题

期刊订阅