Automatic emotion recognition in compressed speech using acoustic and non-linear features

机译：使用声学和非线性功能压缩讲话中的自动情感识别

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Automatic recognition of emotions in speech has attracted the attention of the research community in recent years. Some of the most relevant proposed applications of it are in call-centers. In these scenarios the speech is distorted by compression algorithms. The effects of such distortion on the performance of systems for automatic recognition of emotions must be assessed. In this study these effects are evaluated independently of any other distortions generated by the communications channel. Several state-of-the-art codecs are used to compress the speech signals of two emotional speech databases. The databases used are the Berlin Database of Emotional Speech and the enterface05. The methodology considers voiced and unvoiced segments of the speech separately. Spectral, cepstral, noise and Non-Linear Dynamics (NLD) measures are used to characterize the segments. Finally, a classifier based on a Gaussian Mixture Model (GMM) is used to identify the emotion. The results indicate that voiced segments are less affected by the compression than unvoiced ones in terms in classification accuracy. They also show that the bandwidth of the analyzed signals is an important factor in the classification results.

机译：近年来，自动识别言论中的情绪引起了研究界的注意。其中一些最相关的拟议应用程序在呼叫中心。在这些方案中，语音扭曲了压缩算法。必须评估这种失真对自动识别情绪的系统性能的影响。在该研究中，这些效果独立于通信信道产生的任何其他扭曲进行评估。若干最先进的编解码器用于压缩两个情绪语音数据库的语音信号。使用的数据库是情绪语音和Enterface05的柏林数据库。该方法认为演讲的浊音和清音段。光谱，倒谱，噪声和非线性动力学（NLD）测量用于表征段。最后，使用基于高斯混合模型（GMM）的分类器来识别情绪。结果表明，在分类准确性方面，浊音段的压缩程度较小。他们还表明分析信号的带宽是分类结果的重要因素。

著录项

来源
《Symposium on Signal Processing, Images and Computer Vision》|2015年||共7页
会议地点
作者
Garcia N.; Vasquez-Correa J.C.; Arias-Londono J.D.; Vargas-Bonilla J.F.; Orozco-Arroyave J.R.;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类信息处理（信息加工）;
关键词
codecs; data compression; emotion recognition; speech coding; speech recognition; Berlin database of emotional speech; Gaussian mixture model; acoustic features; automatic emotion recognition; call-centers; codecs; compressed speech; compression algorithms; enterface05; non-linear dynamics; non-linear features; speech databases; speech signals; Accuracy; Codecs; Databases; Emotion recognition; Frequency measurement; Speech; Speech recognition;

机译：编解码器;数据压缩;情感识别;语音编码;语音识别;柏林情绪语音数据库;高斯混合模型;声学特征;自动情感识别;呼叫中心;编解码器;压缩算法;enterfect alceC;enterfact算法;非线性动态;非线性功能;语音数据库;语音信号;准确性;编解码器;数据库;情感识别;频率测量;语音识别;

相似文献

外文文献
中文文献
专利

1. Acoustic feature selection for automatic emotion recognition from speech [J] . Jia Rong, Gang Li, Yi-Ping Phoebe Chen Information Processing & Management . 2009,第3期

机译：语音特征选择可自动识别语音
2. Attention and Feature Selection for Automatic Speech Emotion Recognition Using Utterance and Syllable-Level Prosodic Features [J] . Ben Alex Starlet, Mary Leena, Babu Ben P. Circuits, systems and signal processing . 2020,第11期

机译：用话语和音节级韵律特征对自动语音情感识别的关注和特征选择
3. An optimal two stage feature selection for speech emotion recognition using acoustic features [J] . Swarna Kuchibhotla, Hima Deepthi Vankayalapati, Koteswara Rao Anne International journal of speech technology . 2016,第4期

机译：使用声学特征的语音情感识别的最佳两阶段特征选择
4. Automatic emotion recognition in compressed speech using acoustic and non-linear features [C] . Garcia N., Vasquez-Correa J.C., Arias-Londono J.D., 2015 20th Symposium on Signal Processing, Images and Computer Vision . 2015

机译：利用声学和非线性功能自动识别压缩语音中的情绪
5. Compressive nonlinearity for representing speech spectral magnitude to improve noise robustness of automatic speech recognition . [D] . Wong, Brian. 2011

机译：压缩非线性表示语音频谱幅度提高语音自动识别的鲁棒性。
6. Automatic speech recognition using articulatory features from subject-independent acoustic-to-articulatory inversion [O] . Prasanta Kumar Ghosh, Shrikanth Narayanan -1

机译：使用从独立于受试者的声学到发音反转的发音特征进行自动语音识别
7. Automatic recognition of spontaneous emotions in speech using acoustic and lexical features [O] . Khiet P. Truong, Stephan Raaijmakers 2008

机译：使用声学和词汇特征自动识别语音中的自发情绪

Automatic emotion recognition in compressed speech using acoustic and non-linear features

摘要

著录项

相似文献

相关主题

期刊订阅