ELM speaker identification for limited dataset using multitaper based MFCC and PNCC features with fusion score

Bharath K P; Rajesh Kumar M

首页> 外文期刊>Multimedia Tools and Applications >ELM speaker identification for limited dataset using multitaper based MFCC and PNCC features with fusion score

【24h】

ELM speaker identification for limited dataset using multitaper based MFCC and PNCC features with fusion score

机译：使用基于Multiber的MFCC和PNCC功能的Lim Liment DataSet识别有限数据集，具有融合分数

获取原文

获取原文并翻译 | 示例

获取外文期刊封面封底 >>

开具论文收录证明 >>

文献代查 >>

团队文献服务 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

In current scenario, speaker recognition under noisy condition is the major challenging task in the area of speech processing. Due to noise environment there is a significant degradation in the system performance. The major aim of the proposed work is to identify the speaker's under clean and noise background using limited dataset. In this paper, we proposed a multitaper based Mel frequency cepstral coefficients (MFCC) and power normalization cepstral coefficients (PNCC) techniques with fusion strategies. Here, we used MFCC and PNCC techniques with different multitapers to extract the desired features from the obtained speech samples. Then, cepstral mean and variance normalization (CMVN) and Feature warping (FW) are the two techniques applied to normalize the obtained features from both the techniques. Furthermore, as a system model low dimension i-vector model is used and also different fusion score strategies like mean, maximum, weighted sum, cumulative and concatenated fusion techniques are utilized. Finally extreme learning machine (ELM) is used for classification in order to increase the system identification accuracy (SIA) intern which is having a single layer feedforward neural network with less complexity and time consuming compared to other neural networks. TIMET and SITW 2016 are the two different databases are used to evaluate the proposed system under limited data of these databases. Both clean and noisy backgrounds conditions are used to check the SIA.

机译：在目前的情景中，嘈杂情况下的扬声器识别是语音处理领域的主要具有挑战性的任务。由于噪声环境，系统性能存在显着的降级。拟议工作的主要目标是使用有限数据集识别清洁和噪声背景下的扬声器。在本文中，我们提出了一种基于多兆的MEL频率谱系数（MFCC）和具有融合策略的功率标准化谱系统（PNCC）技术。这里，我们使用MFCC和PNCC技术具有不同的多涂覆，以从所获得的语音样本中提取所需的特征。然后，临时临床均值和方差归一化（CMVN）和特征翘曲（FW）是应用于从这两个技术中获得所获得的特征的两种技术。此外，由于系统模型低维度I-向量模型，并且还利用了平均值，最大，加权和累积和连接的融合技术等不同的融合分数策略。最后，最终学习机（ELM）用于分类，以提高具有单层前馈神经网络的系统识别精度（SIA）实习，与其他神经网络相比具有较小复杂性和耗时的速度。 TIMET和SITW 2016是两种不同的数据库，用于根据这些数据库的有限数据评估所提出的系统。清洁和嘈杂的背景条件都用于检查SIA。

著录项

来源
《Multimedia Tools and Applications 》 |2020年第40期| 28859-28883| 共25页
作者
Bharath K P; Rajesh Kumar M;
展开▼
作者单位

School of Electronics Engineering Vellore Institute of Technology. Vellore India;

School of Electronics Engineering Vellore Institute of Technology. Vellore India;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Multitaper; MFCC; PNCC; Frequency warping; CMVN;

机译：多人;MFCC;PNCC;频率翘曲;CMVN.;

相似文献

外文文献
中文文献
专利

1. Low-Variance Multitaper MFCC Features: A Case Study in Robust Speaker Verification [J] . Kinnunen T. Audio, Speech, and Language Processing, IEEE Transactions on . 2012 ,第7期

机译：低方差多锥MFCC功能：以可靠的说话人验证为例
2. Multilingual Speaker Identification by Combining Evidence from LPR and Multitaper MFCC [J] . B. G. Nagaraja, H. S. Jayanna Journal of Intelligent Systems . 2013 ,第3期

机译：结合LPR和Multitaper MFCC的证据进行多语言说话人识别
3. Speaker identification based on combination of MFCC and UMRT based features [J] . Anett Antony, R. Gopikakumari Procedia Computer Science . 2018 ,第5期

机译：基于MFCC和基于UMRT的功能的组合的扬声器识别
4. Study of fusion strategies and exploiting the combination of MFCC and PNCC features for robust biometric speaker identification [C] . M. T. S. Al-Kaltakchi, W. L. Woo, S. S. Dlay, International Workshop on Biometrics and Forensics . 2016

机译：研究融合策略并利用MFCC和PNCC功能的组合来进行可靠的生物特征说话人识别
5. Robust high range resolution radar target identification using a statistical feature-based classifier with feature level fusion [D] . Mitchell, Richard Allen 1997

机译：使用基于统计的基于特征的融合分类器的分类器进行高分辨力的高分辨率雷达目标识别
6. Identification of candidate drugs using tensor-decomposition-based unsupervised feature extraction in integrated analysis of gene expression between diseases and DrugMatrix datasets [O] . Y.-h. Taguchi -1

机译：在基于疾病和DrugMatrix数据集的基因表达集成分析中使用基于张量分解的无监督特征提取来识别候选药物
7. Multitaper MFCC and normalized multitaper phase-based features for speaker verification [O] . Arash Mansouri, Eduardo Castillo-Guerra 2019

机译：Multitaper MFCC和据扬声器验证的基于归一化的多兆页面相位特征

ELM speaker identification for limited dataset using multitaper based MFCC and PNCC features with fusion score

摘要

著录项

相似文献

相关主题

期刊订阅