...
首页> 外文期刊>Knowledge-Based Systems >Automated accurate speech emotion recognition system using twine shuffle pattern and iterative neighborhood component analysis techniques
【24h】

Automated accurate speech emotion recognition system using twine shuffle pattern and iterative neighborhood component analysis techniques

机译:使用缠绕扫描模式和迭代邻域分量分析技术自动化准确的语音情感识别系统

获取原文
获取原文并翻译 | 示例
   

获取外文期刊封面封底 >>

       

摘要

Speech emotion recognition is one of the challenging research issues in the knowledge-based system and various methods have been recommended to reach high classification capability. In order to achieve high classification performance in speech emotion recognition, a nonlinear multi-level feature generation model is presented by using cryptographic structure. The novelty of this work is the use of cryptographic structure called shuffle box for feature generation and iterative neighborhood component analysis to select the features. The proposed method has three main stages: (i) multi-level feature generation using Tunable Q wavelet transform (TQWT), (ii) twine shuffle pattern (twine-shufpat) for feature generation, and (iii) discriminative features are selected using iterative neighborhood component analysis (INCA) and classified. The TQWT is a multi-level wavelet transformation method used to generate high-level, medium-level, and low-level wavelet coefficients. The proposed twine-shuf-pat technique is used to extract the features from the decomposed wavelet coefficients. INCA feature selector is employed to select the clinically significant features. The performance of the obtained model is validated using four speech emotion public databases (RAVDESS Speech, Emo-DB (Berlin), SAVEE, and EMOVO). Our developed twine-shuf-pat and INCA based method yielded 87.43%, 90.09%, 84.79%, and 79.08% classification accuracies using RAVDESS, Emo-DB (Berlin), SAVEE and EMOVO corpora respectively with 10-fold cross-validation strategy. A mixed database is created from four public speech emotion databases which yielded 80.05% classification accuracy. Our obtained speech emotion model is ready to be tested with huge database and can be used in healthcare applications. (C) 2020 Elsevier B.V. All rights reserved.
机译:语音情感识别是基于知识的系统中的具有挑战性的研究问题之一,建议使用各种方法来达到高分类能力。为了在语音情感识别中实现高分类性能,使用加密结构呈现非线性多级特征生成模型。这项工作的新颖性是使用称为Shuffle框的加密结构,用于特征生成和迭代邻域分量分析,以选择功能。所提出的方法具有三个主要阶段:(i)使用可调调节Q小波变换(TQWT)的多级特征生成,(ii)特征生成的缠绕式混洗模式(Twe-shufpat),(iii)使用迭代选择歧视特征邻域分量分析(INCA)和分类。 TQWT是一种用于生成高电平,中级和低电平小波​​系数的多级小波变换方法。所提出的TweN-Shuf-PAT技术用于从分解的小波系数中提取特征。 INCA功能选择器用于选择临床显着的功能。使用四种语音情感公共数据库(Ravdess Speep,Emo-DB(柏林),Savee和Emovo)验证所获得的模型的性能。我们发育的麻线 - SHUF-PAT和INCA基于INCA的方法产生了87.43%,90.09%,84.79%,使用Ravdess,EMO-DB(柏林),Savee和Edovo Corpora分别具有10倍的交叉验证策略的79.08%。混合数据库是由四个公共语音情感数据库创建的,该数据库产生了80.05%的分类准确性。我们获得的语音情绪模型已准备好用巨大的数据库进行测试,可用于医疗保健应用程序。 (c)2020 Elsevier B.v.保留所有权利。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号