Scale-invariant MFCCs for speech/speaker recognition

Zekeriya TüFEKC?; G?kay D??KEN

首页> 外文期刊>Turkish Journal of Electrical Engineering and Computer Sciences >Scale-invariant MFCCs for speech/speaker recognition

【24h】

Scale-invariant MFCCs for speech/speaker recognition

机译：用于语音/扬声器识别的Scale-Invariant MFCC

获取原文

获取外文期刊封面封底 >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

The feature extraction process is a fundamental part of speech processing. Mel frequency cepstral coefficients (MFCCs) are the most commonly used feature types in the speech/speaker recognition literature. However, the MFCC framework may face numerical issues or dynamic range problems, which decreases their performance. A practical solution to these problems is adding a constant to filter-bank magnitudes before log compression, thus violating the scale-invariant property. In this work, a magnitude normalization and a multiplication constant are introduced to make the MFCCs scale-invariant and to avoid dynamic range expansion of nonspeech frames. Speaker verification experiments are conducted to show the effectiveness of the proposed scheme.

机译：特征提取过程是语音处理的基本部分。 MEL频率患者系数（MFCC）是语音/扬声器识别文献中最常用的特征类型。但是，MFCC框架可能面临数值问题或动态范围问题，这降低了它们的性能。对这些问题的实际解决方案在日志压缩之前将常数添加到滤波器库幅度，从而违反了规模不变的属性。在这项工作中，引入了幅度归一化和乘法常数，以使MFCCS鳞片不变，并避免NonsPeech帧的动态范围扩展。扬声器验证实验进行了展示拟议计划的有效性。

著录项

来源
《Turkish Journal of Electrical Engineering and Computer Sciences》 |2019年第5期|共5页
作者
Zekeriya TüFEKC?; G?kay D??KEN;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种
中图分类
关键词
Feature extractionspeaker recognitionspeech recognition;

机译：功能提取销钉识别行程识别;

相似文献

外文文献
中文文献
专利

1. Automatic Speaker Recognition Dependency on Both the Shape of Auditory Critical Bands and Speaker Discriminative MFCCs [J] . JOKIC I, DELIC V, JOKIC S, Advances in Electrical and Computer Engineering . 2015,第4期

机译：听觉关键带的形状和说话人区分性MFCC的自动说话人识别依赖性
2. Speaker Recognition for Hindi Speech Signal using MFCC-GMM Approach [J] . Ankur Maurya, Divya Kumar, R.K. Agarwal Procedia Computer Science . 2018,第5期

机译：使用MFCC-GMM方法的印地语语音信号扬声器识别
3. Design of an Automatic Speaker Recognition System Based on Adapted MFCC and GMM Methods for Arabic Speech [J] . El Bachir TAZI, Abderrahim BENABBOU, Mostafa. HARTI International journal of computer science and network security . 2010,第1期

机译：基于自适应MFCC和GMM方法的阿拉伯语音自动说话人识别系统设计
4. Speaker Independent Automatic Emotion Recognition from Speech: A Comparison of MFCCs and Discrete Wavelet Transforms [C] . Firoz Shah A., Vimal Krishnan V. R., Raji Sukumar A., International Conference on Advances in Recent Technologies in Communication and Computing . 2009

机译：扬声器独立自动情感识别来自语音：MFCC和离散小波变换的比较
5. A speech recognition IC with an efficient MFCC extraction algorithm and multi-mixture models. [D] . Han, Wei. 2006

机译：具有高效MFCC提取算法和多混合模型的语音识别IC。
6. One-against-All Weighted Dynamic Time Warping for Language-Independent and Speaker-Dependent Speech Recognition in Adverse Conditions [O] . Xianglilan Zhang, Jiping Sun, Zhigang Luo 2010

机译：不利条件下与语言无关和与说话者相关的语音识别的一对多加权动态时间规整
7. Automatic Speaker Recognition Dependency on Both the Shape of Auditory Critical Bands and Speaker Discriminative MFCCs [O] . JOKIC, I., DELIC, V., JOKIC, S., 2015

机译：自动说话人识别依赖于听觉临界频带和扬声器判别mFCC的形状

Scale-invariant MFCCs for speech/speaker recognition

摘要

著录项

相似文献

相关主题

期刊订阅