Feature vector classification based speech emotion recognition for service robots

Jeong-Sik Park; Ji-Hwan Kim; Yung-Hwan Oh

首页> 外文期刊>Consumer Electronics, IEEE Transactions on >Feature vector classification based speech emotion recognition for service robots

【24h】

Feature vector classification based speech emotion recognition for service robots

机译：基于特征向量分类的服务机器人语音情感识别

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

This paper proposes an efficient feature vector classification for Speech Emotion Recognition (SER) in service robots. Since service robots interact with diverse users who are in various emotional states, two important issues should be addressed: acoustically similar characteristics between emotions and variable speaker characteristics due to different user speaking styles. Each of these issues may cause a substantial amount of overlap between emotion models in feature vector space, thus decreasing SER accuracy. In order to reduce the effects caused by such overlaps, this paper proposes an efficient feature vector classification for SER. The conventional feature vector classification applied to speaker identification categorizes feature vectors as overlapped and non-overlapped. Because this method discards all of the overlapped vectors in model reconstruction, it has limitations in constructing robust models when the number of overlapped vectors is significantly increased such as in emotion recognition. The method proposed herein classifies overlapped vectors in a more sophisticated manner, selecting discriminative vectors among overlapped vectors, and adds those vectors in model reconstruction. On SER experiments using an emotional speech corpus, the proposed classification approach exhibited superior performance to conventional methods, and displayed an almost human-level performance. In particular, we achieved commercially applicable performance for two-class (negative vs. non-negative) emotion recognition.

机译：本文提出了一种服务机器人中语音情感识别（SER）的有效特征向量分类方法。由于服务机器人与处于各种情绪状态的不同用户进行交互，因此应解决两个重要的问题：情绪之间的声学上相似的特征以及由于不同用户讲话风格而导致的说话者特征的变化。这些问题中的每一个都可能导致特征向量空间中情感模型之间的大量重叠，从而降低SER准确性。为了减少这种重叠造成的影响，本文提出了一种有效的SER特征向量分类方法。应用于说话者识别的常规特征向量分类将特征向量分类为重叠和不重叠。因为该方法在模型重建中丢弃了所有重叠向量，所以当重叠向量的数量显着增加时（例如在情感识别中），它在构建鲁棒模型方面具有局限性。本文提出的方法以更复杂的方式对重叠向量进行分类，在重叠向量中选择判别向量，并将这些向量添加到模型重建中。在使用情感语音语料库的SER实验中，提出的分类方法表现出优于常规方法的性能，并且显示出几乎与人类水平相同的性能。特别是，我们实现了两类（负与非负）情感识别的商业适用性能。

著录项

来源
《Consumer Electronics, IEEE Transactions on》 |2009年第3期|p.1590-1596|共7页
作者
Jeong-Sik Park; Ji-Hwan Kim; Yung-Hwan Oh;
展开▼
作者单位

Computer Science Division, Korea Advanced Institute of Science and Technology (KAIST), Daejeon, Korea;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Feature vector classification; Service robot; Speech emotion recognition;

机译：特征向量分类;服务机器人;语音情感识别;

相似文献

外文文献
中文文献
专利

1. Deep features-based speech emotion recognition for smart affective services [J] . Badshah Abdul Malik, Rahim Nasir, Ullah Noor, Multimedia Tools and Applications . 2019,第5期

机译：基于深度特征的语音情感识别，用于智能情感服务
2. Physical Features Based Speech Emotion Recognition Using Predictive Classification [J] . Mohammad Ahsan, Madhu Kumari International Journal of Computer Science & Information Technology (IJCSIT) . 2016,第2期

机译：基于预测分类的基于物理特征的语音情感识别
3. Speaker Emotion Recognition based on Speech Features and Classification Techniques [J] . J. Sirisha Devi, Srinivas Yarramalle, Siva Prasad Nandyala International Journal of Image, Graphics and Signal Processing . 2014,第7期

机译：基于语音特征和分类技术的说话人情绪识别
4. An Affective Service based on Multi-Modal Emotion Recognition, using EEG enabled Emotion Tracking and Speech Emotion Recognition [C] . Danai Styliani Moschona IEEE International Conference on Consumer Electronics - Asia . 2020

机译：一种基于多模态情绪识别的情感服务，使用EEG使能情感跟踪和语音情感认可
5. Design of loss functions and feature transformation for minimum classification error based automatic speech recognition [D] . Ratnagiri, Madhavi Vedula 2011

机译：基于最小分类误差的自动语音识别损失函数设计和特征变换
6. Particle Swarm Optimization Based Feature Enhancement and Feature Selection for Improved Emotion Recognition in Speech and Glottal Signals [O] . Hariharan Muthusamy, Kemal Polat, Sazali Yaacob -1

机译：基于粒子群优化的特征增强和特征选择用于语音和声门信号中的情感识别
7. Physical Features Based Speech Emotion Recognition Using Predictive Classification [O] . Mohammad Ahsan, Madhu Kumari 2016

机译：基于物理特征的语音情绪识别使用预测分类

Feature vector classification based speech emotion recognition for service robots

摘要

著录项

相似文献

相关主题

期刊订阅