Effect of Feature Dimension on Classification of Speech Emotions

机译：特征维度对语音情绪分类的影响

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

This paper analyses both the static and temporal dynamics of the spectral features in classifying speech emotions. Initially, different frame-level spectral techniques such as the Linear Prediction Cepstral Coefficients (LPCC), Perceptual LP coefficients (PLP), and Mel-Frequency Cepstral Coefficients (MFCC) have been examined. Further, these spectral features are extracted using Wavelet Analysis (WA) for a better emotional portrayal. The extracted feature sets remain high-dimensional and overload the recognizer with redundant features, large memory space, and slower response. To alleviate these issues and fetch more discriminating parameters, the applicability of Vector Quantization in clustering the data has been explored. Machine learning algorithms such as the Gaussian Mixture Model (GMM), the Probabilistic Neural Network (PNN), and the Multilayer Perceptron (MLP) have been simulated with the derived feature sets for their effectiveness in classifying speech emotions. While the GMM has been efficient in classifying the frame-level feature dimension, the NN-based classifiers outperform the GMM for low feature dimensions as revealed from our results.

机译：本文分析了语音情绪中谱特征的静态和时间动态。最初，已经检查了不同帧级光谱技术，例如线性预测谱系数（LPCC），感知LP系数（PLP）和熔融频率谱系数（MFCC）。此外，使用小波分析（WA）提取这些光谱特征，以获得更好的情绪化写法。提取的特征集保持高维度并过载具有冗余功能，大的内存空间和较慢的响应。为了缓解这些问题并获取更多鉴别参数，探讨了矢量量化在聚类数据中的适用性已经探讨了。已经模拟了诸如高斯混合模型（GMM），概率神经网络（PNN）和多层Perceptron（MLP）的机器学习算法已经用衍生特征集模拟了它们在分类语音情绪方面的有效性。虽然GMM在分类帧级特征尺寸方面有效，但基于NN的分类器优于我们的结果所揭示的低特征尺寸的GMM。

著录项

来源
《Odisha International Conference on Electrical Power Engineering, Communication and Computing Technology》|2021年|1-5|共5页
会议地点
作者
Hemanta Kumar Palo; Niharika Pattanaik; Bibhu Prasad Mohanty; Laxmi Prasad Mishra;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Power engineering; Vector quantization; Artificial neural networks; Multilayer perceptrons; Feature extraction; Wavelet analysis; Probabilistic logic;

机译：电力工程;矢量量化;人工神经网络;多层感知;特征提取;小波分析;概率逻辑;

相似文献

外文文献
中文文献
专利

1. Speech emotion recognition using hybrid spectral-prosodic features of speech signal/glottal waveform, metaheuristic-based dimensionality reduction, and Gaussian elliptical basis function network classifier [J] . Daneshfar Fatemeh, Kabudian Seyed Jahanshah, Neekabadi Abbas Applied Acoustics . 2020,第Sepa期

机译：语音情感识别使用语音信号/光学波形的混合谱 - 韵律特征，基于血管训练的维数减少和高斯椭圆形基函数网络分类器
2. Acoustic feature selection and classification of emotions in speech using a 3D continuous emotion model [J] . Humberto Perez-Espinosa, Carlos A. Reyes-Garcia, Luis Villasenor-Pineda Biomedical signal processing and control . 2012,第1期

机译：使用3D连续情感模型对语音中的情感进行声学特征选择和分类
3. Multi-stage classification of emotional speech motivated by a dimensional emotion model [J] . Zhongzhe Xiao, Emmanuel Dellandrea, Weibei Dou, Multimedia Tools and Applications . 2010,第1期

机译：多维情感模型驱动的情感言语多阶段分类
4. Emotions in speech - experiments with prosody and quality features in speech for use in categorical and dimensional emotion recognition environments [C] . Borchert, M., Dusterhoft, . 2005

机译：语音中的情感-具有语音韵律和质量特征的实验，用于类别和维度情感识别环境
5. Nonlinear discriminant analysis based feature dimensionality reduction for automatic speech recognition. [D] . Hu, Hongbing. 2010

机译：基于非线性判别分析的特征维数缩减，可实现自动语音识别。
6. On the Speech Properties and Feature Extraction Methods in Speech Emotion Recognition [O] . Juraj Kacur, Boris Puterka, Jarmila Pavlovicova, 2021

机译：语音情感识别中的语音特性和特征提取方法
7. Speech emotion classification with the combination of statistic features and temporal features [O] . Dan-ning Jiang, Lian-hong Cai 2004

机译：语音情感分类与统计特征和时间特征的结合

Effect of Feature Dimension on Classification of Speech Emotions

摘要

著录项

相似文献

相关主题

期刊订阅