...
首页> 外文期刊>Multimedia Tools and Applications >ELM speaker identification for limited dataset using multitaper based MFCC and PNCC features with fusion score
【24h】

ELM speaker identification for limited dataset using multitaper based MFCC and PNCC features with fusion score

机译:使用基于Multiber的MFCC和PNCC功能的Lim Liment DataSet识别有限数据集,具有融合分数

获取原文
获取原文并翻译 | 示例

摘要

In current scenario, speaker recognition under noisy condition is the major challenging task in the area of speech processing. Due to noise environment there is a significant degradation in the system performance. The major aim of the proposed work is to identify the speaker's under clean and noise background using limited dataset. In this paper, we proposed a multitaper based Mel frequency cepstral coefficients (MFCC) and power normalization cepstral coefficients (PNCC) techniques with fusion strategies. Here, we used MFCC and PNCC techniques with different multitapers to extract the desired features from the obtained speech samples. Then, cepstral mean and variance normalization (CMVN) and Feature warping (FW) are the two techniques applied to normalize the obtained features from both the techniques. Furthermore, as a system model low dimension i-vector model is used and also different fusion score strategies like mean, maximum, weighted sum, cumulative and concatenated fusion techniques are utilized. Finally extreme learning machine (ELM) is used for classification in order to increase the system identification accuracy (SIA) intern which is having a single layer feedforward neural network with less complexity and time consuming compared to other neural networks. TIMET and SITW 2016 are the two different databases are used to evaluate the proposed system under limited data of these databases. Both clean and noisy backgrounds conditions are used to check the SIA.
机译:在目前的情景中,嘈杂情况下的扬声器识别是语音处理领域的主要具有挑战性的任务。由于噪声环境,系统性能存在显着的降级。拟议工作的主要目标是使用有限数据集识别清洁和噪声背景下的扬声器。在本文中,我们提出了一种基于多兆的MEL频率谱系数(MFCC)和具有融合策略的功率标准化谱系统(PNCC)技术。这里,我们使用MFCC和PNCC技术具有不同的多涂覆,以从所获得的语音样本中提取所需的特征。然后,临时临床均值和方差归一化(CMVN)和特征翘曲(FW)是应用于从这两个技术中获得所获得的特征的两种技术。此外,由于系统模型低维度I-向量模型,并且还利用了平均值,最大,加权和累积和连接的融合技术等不同的融合分数策略。最后,最终学习机(ELM)用于分类,以提高具有单层前馈神经网络的系统识别精度(SIA)实习,与其他神经网络相比具有较小复杂性和耗时的速度。 TIMET和SITW 2016是两种不同的数据库,用于根据这些数据库的有限数据评估所提出的系统。清洁和嘈杂的背景条件都用于检查SIA。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号