The Performance of the Speaking Rate Parameter in Emotion Recognition from Speech

机译：语音识别中讲话率参数的表现

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

The speaking rate is a quite obvious prosodic characteristic of speech and humans can easily estimate how fast an interlocutor is talking. Further, different emotional dispositions of a person are strongly expressed in his/her speaking rate. In this paper we investigate the performance gain originating from the use of the speaking rate parameter in emotion recognition from speech. The speaking rates are determined by applying a broad phonetic class recognizer. The classifier is trained on cepstral features extracted on the emotionally neutral RM1 speech corpus and provides low average recognition errors of one phoneme/second. We present the results of an empirical approach on the emotionally expressive Emo-DB corpus applying a neural network classifier and prove the significant influence of the speaking rate in emotion classification. The performances of Multi-Layer Perceptrons trained on cepstral turn-level features are analyzed with respect to the presence and absence of the speaking rate feature. An increase of accuracy up to 3.7% in certain emotion categories is reported.

机译：发言率是言语的一个相当明显的韵律特征，人类可以很容易地估计交流者的态度。此外，某人的不同情感处置是以他/她的发言率强烈表达的。在本文中，我们调查源自使用说话率参数在情感识别中的性能增益。通过应用广泛的语音类识别器确定说话率。分类器培训对在情绪中性RM1语音语料库上提取的倒谱特征，并提供一个音素/秒的低平均识别误差。我们在应用神经网络分类器的情感表达EMO-DB语料库上介绍了经验方法的结果，并证明了情绪分类中说话率的显着影响。关于讲速率特征的存在和不存在，分析了对临时转弯级别特征培训的多层感知的性能。报告了某些情感类别的准确性增加到3.7％。

著录项

来源
《IEEE International Conference on Multimedia and Expo》|2012年||共6页
会议地点
作者
Philippou-Hubner David; Vlasenko Bogdan; Bock Ronald; Wendemuth Andreas;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP37-53;
关键词
入库时间 2022-08-21 13:11:14

相似文献

外文文献
中文文献
专利

1. Speaker independent feature selection for speech emotion recognition: A multi-task approach [J] . Kalhor Elham, Bakhtiari Behzad Multimedia Tools and Applications . 2021,第6期

机译：演讲者独立的语音情感识别特征选择：多任务方法
2. An efficient algorithm for recognition of emotions from speaker and language independent speech using deep learning [J] . Singh Youddha Beer, Goel Shivani Multimedia Tools and Applications . 2021,第9期

机译：一种高效算法，用于使用深度学习识别扬声器和语言独立演讲的情绪
3. Speaker Awareness for Speech Emotion Recognition [J] . Gustavo Assun??o, Paulo Menezes, Fernando Perdig?o International journal of online engineering . 2020,第04期

机译：演讲者言语情感认可的意识
4. The Performance of the Speaking Rate Parameter in Emotion Recognition from Speech [C] . Philippou-Hubner David, Vlasenko Bogdan, Bock Ronald, Proceedings of the 2012 IEEE International Conference on Multimedia and Expo Workshops . 2012

机译：语音参数在语音识别中的性能
5. Robust speech processing based on microphone array, audio-visual, and frame selection for in-vehicle speech recognition and in-set speaker recognition. [D] . Zhang, Xianxian. 2005

机译：基于麦克风阵列，视听和帧选择的强大语音处理功能，可实现车载语音识别和内置说话人识别。
6. Speaker-sensitive emotion recognition via ranking: Studies on acted and spontaneous speech☆ [O] . Houwei Cao, Ragini Verma, Ani Nenkova -1

机译：通过排名对说话者敏感的情感识别：对言语行为和自发言语的研究☆
7. Speaker Recognition and Speech Emotion Recognition Based on GMM [O] . Shupeng Xu, Yan Liu, Xiping Liu 2013

机译：基于GMM的扬声器识别和语音情感识别
8. Robust Speech Processing & Recognition: Speaker ID, Language ID, Speech Recognition/Keyword Spotting, Diarization/Co-Channel/Environmental Characterization, Speaker State Assessment. [R] . Hansen, J. H. 2015

机译：强大的语音处理和识别：说话者ID，语言ID，语音识别/关键字识别，Diarization / Co-Channel /环境表征，说话者状态评估。

The Performance of the Speaking Rate Parameter in Emotion Recognition from Speech

摘要

著录项

相似文献

相关主题

期刊订阅