MULTIPLE WINDOWED SPECTRAL FEATURES FOR EMOTION RECOGNITION

机译：情绪识别的多个窗口频谱特征

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

MFCC (Mel Frequency Cepstral Coefficients) and PLP (Perceptual linear prediction coefficients) or RASTA-PLP have demonstrated good results whether when they are used in combination with prosodic features as suprasegmental (long-term) information or when used stand-alone as segmental (short-time) information. MFCC and PLP feature parameterization aims to represent the speech parameters in a way similar to how sound is perceived by humans. However, MFCC and PLP are usually computed from a Hamming-windowed periodogram spectrum estimate that is characterized by large variance. In this paper we study the effect of averaging spectral estimates obtained using a set of orthogonal tapers (windows) on emotion recognition performance. The multitaper MFCC and PLP are examined separately as short-time information vectors modeled using Gaussian mixture models (GMMs). When tested on the FAU AIBO spontaneous emotion corpus, a relative improvement ranging from 2.2% to 3.9% for both MFCC and PLP systems is achieved by multiple windowed spectral features compared to single windowed ones.

机译：MFCC（梅尔频率倒谱系数）和PLP（感知线性预测系数），或者RASTA-PLP已经表现出良好的结果是否当它们组合使用韵律特征为超音（长期）信息或使用时独立为节段性（短时间）的信息。 MFCC和PLP特征参数的目的来表示相似的声音如何被人类感知的方式，语音参数。然而，MFCC和PLP通常由一个汉明窗周期图谱估计，其特征在于大的方差来计算。在本文中，我们研究均使用一组情感识别性能正交锥度（窗口）的得到的光谱估计的影响。在多窗口MFCC和PLP作为使用高斯混合模型（的GMM）建模短时信息矢量分别检查。当在FAU AIBO自发情感语料库测试，相对改善相比单一窗口那些范围从2.2％到两个MFCC和PLP系统由多个实现3.9％窗口的光谱特征。

著录项

来源
《IEEE International Conference on Acoustics, Speech, and Signal Processing》|2013年||共5页
会议地点
作者
Yazid Attabi; Jahangir Alam; Pierre Dumouchel; Patrick Kenny; Douglas OShaughnessy;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TN912-53;
关键词

相似文献

外文文献
中文文献
专利

1. Speech emotion recognition using emotion perception spectral feature [J] . Jiang Lin, Tan Ping, Yang Junfeng, Concurrency and computation: practice and experience . 2021,第11期

机译：语音情感识别使用情感感知光谱特征
2. Speech emotion recognition using hybrid spectral-prosodic features of speech signal/glottal waveform, metaheuristic-based dimensionality reduction, and Gaussian elliptical basis function network classifier [J] . Daneshfar Fatemeh, Kabudian Seyed Jahanshah, Neekabadi Abbas Applied Acoustics . 2020,第Sepa期

机译：语音情感识别使用语音信号/光学波形的混合谱 - 韵律特征，基于血管训练的维数减少和高斯椭圆形基函数网络分类器
3. Recognition of human emotion with spectral features using multi layer-perceptron [J] . Reddy A. Pramod, Vijayarajan V. International journal of knowledge-based and intelligent engineering systems . 2020,第3期

机译：使用多层 - Perceptron对人类情感的认识
4. Multiple windowed spectral features for emotion recognition [C] . Attabi Yazid, Alam Md Jahangir, Dumouchel Pierre, IEEE International Conference on Acoustics, Speech and Signal Processing . 2013

机译：多个窗口光谱特征可用于情感识别
5. Multiple window time-varying spectral analysis. [D] . Cakrak, Ferhat. 2001

机译：多窗口时变频谱分析。
6. Class-Level Spectral Features for Emotion Recognition [O] . Dmitri Bitouk, Ragini Verma, Ani Nenkova -1

机译：类级光谱特征的情感识别
7. Emotion Recognition Using Spectral Feature from Facial Electromygraphy Signals for Human-Machine Interface [O] . Jayendhra Shiva, Navaneethakrishna Makaram, P.A. Karthick, 2021

机译：使用来自面部电学造影信号的情感识别用于人机界面

MULTIPLE WINDOWED SPECTRAL FEATURES FOR EMOTION RECOGNITION

摘要

著录项

相似文献

相关主题

期刊订阅