首页> 外文会议>Symposium on Signal Processing, Images and Computer Vision >Automatic emotion recognition in compressed speech using acoustic and non-linear features
【24h】

Automatic emotion recognition in compressed speech using acoustic and non-linear features

机译:使用声学和非线性功能压缩讲话中的自动情感识别

获取原文

摘要

Automatic recognition of emotions in speech has attracted the attention of the research community in recent years. Some of the most relevant proposed applications of it are in call-centers. In these scenarios the speech is distorted by compression algorithms. The effects of such distortion on the performance of systems for automatic recognition of emotions must be assessed. In this study these effects are evaluated independently of any other distortions generated by the communications channel. Several state-of-the-art codecs are used to compress the speech signals of two emotional speech databases. The databases used are the Berlin Database of Emotional Speech and the enterface05. The methodology considers voiced and unvoiced segments of the speech separately. Spectral, cepstral, noise and Non-Linear Dynamics (NLD) measures are used to characterize the segments. Finally, a classifier based on a Gaussian Mixture Model (GMM) is used to identify the emotion. The results indicate that voiced segments are less affected by the compression than unvoiced ones in terms in classification accuracy. They also show that the bandwidth of the analyzed signals is an important factor in the classification results.
机译:近年来,自动识别言论中的情绪引起了研究界的注意。其中一些最相关的拟议应用程序在呼叫中心。在这些方案中,语音扭曲了压缩算法。必须评估这种失真对自动识别情绪的系统性能的影响。在该研究中,这些效果独立于通信信道产生的任何其他扭曲进行评估。若干最先进的编解码器用于压缩两个情绪语音数据库的语音信号。使用的数据库是情绪语音和Enterface05的柏林数据库。该方法认为演讲的浊音和清音段。光谱,倒谱,噪声和非线性动力学(NLD)测量用于表征段。最后,使用基于高斯混合模型(GMM)的分类器来识别情绪。结果表明,在分类准确性方面,浊音段的压缩程度较小。他们还表明分析信号的带宽是分类结果的重要因素。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号