Intermediary Fuzzification in Speech Emotion Recognition

机译：语音情感识别中的中间模糊化

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Affective systems are getting increasingly more attention from researchers and high-tech companies in order to enable the acknowledgment or adaptation to a user’s mood. Emotion classification is typically a hard problem due to the number of subtle cues which are present in human facial and body expressions, or in voiced utterances. Another critical factor is that typically used models tend to map emotions into all-or-nothing regions with artificially sharp divisions among them, a view which is rather unsupported in the field of psychology and human behavioral analysis. In this paper we propose the inclusion of an intermediary fuzzy layer in a VGGVox-based NN, whose aim is to deal with the inherently foggy transitions between emotional states. This neuro-fuzzy model was trained and evaluated against four emotional speech databases and has shown improvements in the classification performance over a non-fuzzy counterpart. Observed performances were also on-par or above those of other current state-of-the-art techniques.

机译：情感系统越来越受到研究人员和高科技公司的关注，以便能够确认或适应用户的情绪。由于存在于人的面部和身体表情或发声中的微妙线索的数量，情绪分类通常是一个难题。另一个关键因素是，通常使用的模型倾向于将情感映射到全有或全无的区域，其中人为地将其划分得很清晰，这一观点在心理学和人类行为分析领域中是相当缺乏支持的。在本文中，我们建议在基于VGGVox的NN中包括一个中间模糊层，其目的是处理情绪状态之间固有的模糊过渡。该神经模糊模型是针对四个情感语音数据库进行训练和评估的，与非模糊模型相比，该模型在分类性能上已有改进。观察到的性能也达到或超过了其他当前最新技术水平。

著录项

来源
《》|2020年|1-6|共6页
会议地点
作者
Gustavo Assunção; Paulo Menezes;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Emotion recognition; Feature extraction; Speech recognition; Neural networks; Fuzzy logic; Machine learning; Spectrogram;

机译：情感识别;特征提取;语音识别;神经网络;模糊逻辑;机器学习;声谱图;

相似文献

外文文献
中文文献
专利

1. Assessment of spontaneous emotional speech database toward emotion recognition: Intensity and similarity of perceived emotion from spontaneously expressed emotional speech [J] . Arimoto Y., Ohno S., Iida H. Acoustical science and technology . 2011,第1期

机译：评估针对情绪识别的自发情绪语音数据库：自发表达情绪语音时感知到的情绪的强度和相似性
2. Assessment of spontaneous emotional speech database toward emotion recognition: Intensity and similarity of perceived emotion from spontaneously expressed emotional speech [J] . Yoshiko Arimoto, Sumio Ohno, Hitoshi Iida Acoustical science and technology . 2011,第1期

机译：评估自发情绪语音数据库对情绪识别的作用：自发表达情绪语音时感知到的情绪的强度和相似性
3. Temporal Pattern Classification using Kernel Methods for Speech Recognition and Speech Emotion Recognition [J] . C. Chandra Sekhar, S. Chandrakala Defence Science Journal . 2010,第4期

机译：语音识别和语音情感识别的内核方法时空模式分类
4. Emotion Controllable Speech Synthesis Using Emotion-Unlabeled Dataset with the Assistance of Cross-Domain Speech Emotion Recognition [C] . Xiong Cai, Dongyang Dai, Zhiyong Wu, IEEE International Conference on Acoustics, Speech and Signal Processing . 2021

机译：在跨域语音情感认可的帮助下，使用情感 - 未标记的数据集进行情感可控语音合成
5. Domain Adaptation for Speech Based Emotion Recognition [D] . Abdelwahab, Mohammed. 2019

机译：基于语音情感识别的域适应
6. Recognition of Emotions in Mexican Spanish Speech: An Approach Based on Acoustic Modelling of Emotion-Specific Vowels [O] . Santiago-Omar Caballero-Morales 2013

机译：墨西哥西班牙语语音中的情绪识别：一种基于情绪特定元音声学模型的方法
7. Autonomous Emotion Learning in Speech: A View of Zero-Shot Speech Emotion Recognition [O] . Xinzhou Xu, Jun Deng, Nicholas Cummins, 2019

机译：言语中的自主情感学习：零射击语音情感识别的视图

Intermediary Fuzzification in Speech Emotion Recognition

摘要

著录项

相似文献

相关主题

期刊订阅