Hierarchical classifier design for speech emotion recognition in the mixed-cultural environment

Vasuki P.; Aravindan Chandrabose

首页> 外文期刊>Journal of Experimental & Theoretical Artificial Intelligence >Hierarchical classifier design for speech emotion recognition in the mixed-cultural environment

【24h】

Hierarchical classifier design for speech emotion recognition in the mixed-cultural environment

机译：混合文化环境中语音情感识别的等级分类器设计

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Recognition of emotion in speech is a difficult task due to many speaker factors like gender, age, and the cultural background (nationality, ethnicity, and region) as well as the acoustical environment. Among these factors, the cultural background of the speaker has a strong influence on the expression of emotion. The reason for the unsatisfactory performance of an emotion recognition engine built using mixed-cultural samples can be traced back to this. To address this issue, a two-level hierarchical engine has been designed to identify emotion from the speech of different cultural backgrounds. The first level of the hierarchical engine is a culture identification system, which identifies the corpus of an input utterance. As most of the speakers involved in the construction of a specific corpus are from the same locality and cultural background, we assume that a corpus represents the cultural background of the speakers of the corpus constructed. Based on the response of the first level classifier, the input utterance is forwarded to an appropriate corpus-specific emotion recognition engine, in the second level. Each corpus-specific emotion recognition system is a discriminative, multiclass SVM classifier, trained with the emotional utterances of that particular corpus. The system has been tested with five different corpora, collected from diverse cultural backgrounds, namely EMO-DB, SAVEE, IITKGP-SEC, Spanish corpus S0329, and CMU's Woogles corpus. The system achieved an accuracy of 82.01% which is an improvement of 13.38% over monolithic approaches.

机译：由于性别，年龄和文化背景（民族，种族和地区）以及声学环境以及声学环境以及声学环境，造成言论的情感造成的情绪是一项艰巨的任务。在这些因素中，演讲者的文化背景对情感的表达有很大影响。使用混合文化样品建造的情感识别发动机表现不令人满意的原因可以追溯到这一点。为了解决这个问题，旨在旨在识别不同文化背景的言论的情感。分层发动机的第一级是一种培养识别系统，其识别输入话语的语料库。由于大多数涉及特定语料库的扬声器都来自同一地点和文化背景，我们假设语料库代表了构建的语料库的文化背景。基于第一级分类器的响应，将输入话语转发到第二级的适当语料库特定情感识别引擎。每个语料库特定的情感识别系统是一种鉴别的多字母SVM分类器，培训了那种特定语料库的情感话语。该系统已通过五种不同的基层测试，从不同的文化背景，即emo-db，savee，iitkgp-sec，西班牙语语料库s0329和cmu的Woogles语料库中收集。该系统达到了82.01％的准确度，其在整体方法上的提高13.38％。

著录项

来源
《Journal of Experimental & Theoretical Artificial Intelligence》 |2021年第3期|451-466|共16页
作者
Vasuki P.; Aravindan Chandrabose;
展开▼
作者单位

SSN Coll Engn Dept IT Chennai Tamil Nadu India;

SSN Coll Engn Dept CSE Chennai Tamil Nadu India;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Speech emotion classification; hierarchical classification system; integrated corpus environment;

机译：语音情感分类;分层分类系统;集成了语料库环境;
入库时间 2022-08-19 02:04:09

相似文献

外文文献
中文文献
专利

1. Speech emotion recognition using hybrid spectral-prosodic features of speech signal/glottal waveform, metaheuristic-based dimensionality reduction, and Gaussian elliptical basis function network classifier [J] . Daneshfar Fatemeh, Kabudian Seyed Jahanshah, Neekabadi Abbas Applied Acoustics . 2020,第Sepa期

机译：语音情感识别使用语音信号/光学波形的混合谱 - 韵律特征，基于血管训练的维数减少和高斯椭圆形基函数网络分类器
2. Bag-of-words from image to speech: a multi-classifier emotions recognition system [J] . Mai Ezz-Eldin, Ali Ismail Awad, Hesham F. A. Hamed, International Journal of Engineering & Technology . 2020,第3期

机译：图像与图像中的文字语言：多分类器情绪识别系统
3. A novel speech emotion recognition algorithm based on wavelet kernel sparse classifier in stacked deep auto-encoder model [J] . Wei Pengcheng, Zhao Yu Personal and Ubiquitous Computing . 2019,第3a4期

机译：堆叠深度自动编码器模型中基于小波核稀疏分类器的语音情感识别新算法
4. Speech based Emotion Recognition based on hierarchical decision tree with SVM, BLG and SVR classifiers [C] . Garg Vipul, Kumar Harsh, Sinha Rohit National Conference on Communications . 2013

机译：基于带有SVM，BLG和SVR分类器的分层决策树的基于语音的情感识别
5. Hierarchical learning of discriminative features and classifiers for large-scale visual recognition. [D] . Zhou, Ning. 2014

机译：用于大规模视觉识别的区分性特征和分类器的分层学习。
6. Classifier Subset Selection for the Stacked Generalization Method Applied to Emotion Recognition in Speech [O] . Aitor Álvarez, Basilio Sierra, Andoni Arruti, 2016

机译：用于语音情感识别的堆叠泛化方法的分类器子集选择
7. On The Differences Between Song and Speech Emotion Recognition: Effect of Feature Sets, Feature Types, and Classifiers [O] . Bagus Tris Atmaja, Masato Akagi 2020

机译：关于歌曲与语音情感识别的差异：功能集，特征类型和分类器的效果
8. Design and Requirements Evolution of a Speech Recognition Technology for Tactical Applications and Environments [R] . Reed, L. 2004

机译：战术应用与环境语音识别技术的设计与要求演变

Hierarchical classifier design for speech emotion recognition in the mixed-cultural environment

摘要

著录项

相似文献

相关主题

期刊订阅