Ranking Speech Features for Their Usage in Singing Emotion Classification

机译：在唱歌情感分类中的使用量排名言论

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

This paper aims to retrieve speech descriptors that may be useful for the classification of emotions in singing. For this purpose, Mel Frequency Cepstral Coefficients (MFCC) and selected Low-Level MPEG 7 descriptors were calculated based on the RAVDESS dataset. The database contains recordings of emotional speech and singing of professional actors presenting six different emotions. Employing the algorithm of Feature Selection based on the Forest of Trees method, descriptors with the best ranking results were determined. Then, the emotions were classified using the Support Vector Machine (SVM). The training was performed several times, and the results were averaged. It was found that descriptors used for emotion detection in speech are not as useful for singing. Also, an approach using Convolutional Neural Network (CNN) employing spectrogram representation of audio signals was tested. Several parameters for singing were determined, which, according to the obtained results, allow for a significant reduction in the dimensionality of feature vectors while increasing the classification efficiency of emotion detection.

机译：本文旨在检索语音描述符，可能有助于唱歌中情绪的分类。为此目的，基于RACDES数据集计算MEL频率谱系齐系数（MFCC）和选择的低级MPEG 7描述符。该数据库包含情感言论的录音，唱歌呈现出六种不同情绪的专业演员。采用基于树木林的特征选择算法，确定了具有最佳排名结果的描述符。然后，使用支持向量机（SVM）进行分类的情绪。训练是多次进行的，并且结果平均。结果发现，用于言论中的情感检测的描述符不像唱歌那么有用。此外，测试了采用音频信号的频谱图表示的使用卷积神经网络（CNN）的方法。确定用于唱歌的几个参数，根据所得结果，允许在特征向量的维度的显着降低，同时增加情绪检测的分类效率。

著录项

来源
《International symposium on methodologies for intelligent systems》|2020年|225-234|共10页
会议地点
作者
Szymon Zaporowski; Bozena Kostek;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Mel Frequency Cepstral Coefficients (MFCC); MPEG 7 low-level audio descriptors; Feature selection; Singing expression classification;

机译：MEL频率抗肌射潮系数（MFCC）;MPEG 7低级音频描述符;功能选择;唱歌表达分类;
入库时间 2022-08-26 13:53:55

相似文献

外文文献
中文文献
专利

1. Acoustic feature selection and classification of emotions in speech using a 3D continuous emotion model [J] . Humberto Perez-Espinosa, Carlos A. Reyes-Garcia, Luis Villasenor-Pineda Biomedical signal processing and control . 2012,第1期

机译：使用3D连续情感模型对语音中的情感进行声学特征选择和分类
2. Emotion in the singing voice—a deeperlook at acoustic features in the light ofautomatic classification [J] . Florian Eyben, Gláucia L Salom?o, Johan Sundberg, EURASIP journal on audio, speech, and music processing . 2015,第1期

机译：歌声中的情感-根据自动分类更深入地了解声学特征
3. Emotion classification from speech signal based on empirical mode decomposition and non-linear features [J] . Palani Thanaraj Krishnan, Alex Noel Joseph Raj, Vijayarajan Rajangam Complex & Intelligent Systems . 2021,第4期

机译：基于经验模式分解和非线性功能的语音信号情感分类
4. SPEECH-TO-SINGING SYNTHESIS: CONVERTING SPEAKING VOICES TO SINGING VOICES BY CONTROLLING ACOUSTIC FEATURES UNIQUE TO SINGING VOICES [C] . Takeshi Saitou, Masataka Goto, Masashi Unoki, IEEE Workshop on Applications of Signal Processing to Audio and Acoustics . 2007

机译：演讲歌唱综合：通过控制独特的声音独特的声学功能转换说话的声音来唱歌
5. New Features for Speech Processing Standard Pronunciation Classification [D] . Zhuang, Mutian. 2021

机译：语音处理标准发音分类的新功能
6. Automatic speech and singing classification in ambulatory recordings for normal and disordered voices [O] . Andrew J. Ortiz, Laura E. Toles, Katherine L. Marks, -1

机译：自动录制语音和唱歌分类以记录正常和无序的声音
7. Emotion in the singing voice—a deeperlook at acoustic features in the light ofautomatic classification [O] . 2015

机译：歌声中的情感-根据自动分类更深入地了解声学特征

Ranking Speech Features for Their Usage in Singing Emotion Classification

摘要

著录项

相似文献

相关主题

期刊订阅