The Performance of the Speaking Rate Parameter in Emotion Recognition from Speech

机译：语音参数在语音识别中的性能

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

The speaking rate is a quite obvious prosodic characteristic of speech and humans can easily estimate how fast an interlocutor is talking. Further, different emotional dispositions of a person are strongly expressed in his/her speaking rate. In this paper we investigate the performance gain originating from the use of the speaking rate parameter in emotion recognition from speech. The speaking rates are determined by applying a broad phonetic class recognizer. The classifier is trained on cepstral features extracted on the emotionally neutral RM1 speech corpus and provides low average recognition errors of one phoneme/second. We present the results of an empirical approach on the emotionally expressive Emo-DB corpus applying a neural network classifier and prove the significant influence of the speaking rate in emotion classification. The performances of Multi-Layer Perceptrons trained on cepstral turn-level features are analyzed with respect to the presence and absence of the speaking rate feature. An increase of accuracy up to 3.7% in certain emotion categories is reported.

机译：语速是语音的一个很明显的韵律特征，人们可以轻松地估计对话者说话的速度。此外，一个人的不同情感倾向在他/她的语速中得到强烈表达。在本文中，我们研究了在语音情感识别中，由于使用了语速参数而导致的性能提升。语音速率是通过应用广泛的语音分类识别器来确定的。分类器接受了在情感中性的RM1语音语料库上提取的倒谱特征的训练，并提供了一个音素/秒的低平均识别误差。我们提出了使用神经网络分类器对情感表达Emo-DB语料库进行实证研究的结果，并证明了语速在情感分类中的重要影响。关于倒谱特征，对多层感知器的性能进行了倒谱特性的分析。据报道，在某些情绪类别中，准确性提高了3.7％。

著录项

来源
《Proceedings of the 2012 IEEE International Conference on Multimedia and Expo Workshops》|2012年|p.296- 301|共6页
会议地点 Melbourne(AU)
作者
Philippou-Hubner David; Vlasenko Bogdan; Bock Ronald; Wendemuth Andreas;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类多媒体技术与多媒体计算机;
关键词

相似文献

外文文献
中文文献
专利

1. Speaker independent feature selection for speech emotion recognition: A multi-task approach [J] . Kalhor Elham, Bakhtiari Behzad Multimedia Tools and Applications . 2021,第6期

机译：演讲者独立的语音情感识别特征选择：多任务方法
2. An efficient algorithm for recognition of emotions from speaker and language independent speech using deep learning [J] . Singh Youddha Beer, Goel Shivani Multimedia Tools and Applications . 2021,第9期

机译：一种高效算法，用于使用深度学习识别扬声器和语言独立演讲的情绪
3. Speaker Awareness for Speech Emotion Recognition [J] . Gustavo Assun??o, Paulo Menezes, Fernando Perdig?o International journal of online engineering . 2020,第04期

机译：演讲者言语情感认可的意识
4. The Performance of the Speaking Rate Parameter in Emotion Recognition from Speech [C] . Philippou-Hubner David, Vlasenko Bogdan, Bock Ronald, 2012 IEEE International Conference on Multimedia and Expo . 2012

机译：语音参数在语音识别中的性能
5. Robust speech processing based on microphone array, audio-visual, and frame selection for in-vehicle speech recognition and in-set speaker recognition. [D] . Zhang, Xianxian. 2005

机译：基于麦克风阵列，视听和帧选择的强大语音处理功能，可实现车载语音识别和内置说话人识别。
6. Speaker-sensitive emotion recognition via ranking: Studies on acted and spontaneous speech☆ [O] . Houwei Cao, Ragini Verma, Ani Nenkova -1

机译：通过排名对说话者敏感的情感识别：对言语行为和自发言语的研究☆
7. Speaker Recognition and Speech Emotion Recognition Based on GMM [O] . Shupeng Xu, Yan Liu, Xiping Liu 2013

机译：基于GMM的扬声器识别和语音情感识别
8. Robust Speech Processing & Recognition: Speaker ID, Language ID, Speech Recognition/Keyword Spotting, Diarization/Co-Channel/Environmental Characterization, Speaker State Assessment. [R] . Hansen, J. H. 2015

机译：强大的语音处理和识别：说话者ID，语言ID，语音识别/关键字识别，Diarization / Co-Channel /环境表征，说话者状态评估。

The Performance of the Speaking Rate Parameter in Emotion Recognition from Speech

摘要

著录项

相似文献

相关主题

期刊订阅