首页> 外文会议>International symposium on methodologies for intelligent systems >Ranking Speech Features for Their Usage in Singing Emotion Classification
【24h】

Ranking Speech Features for Their Usage in Singing Emotion Classification

机译:在唱歌情感分类中的使用量排名言论

获取原文

摘要

This paper aims to retrieve speech descriptors that may be useful for the classification of emotions in singing. For this purpose, Mel Frequency Cepstral Coefficients (MFCC) and selected Low-Level MPEG 7 descriptors were calculated based on the RAVDESS dataset. The database contains recordings of emotional speech and singing of professional actors presenting six different emotions. Employing the algorithm of Feature Selection based on the Forest of Trees method, descriptors with the best ranking results were determined. Then, the emotions were classified using the Support Vector Machine (SVM). The training was performed several times, and the results were averaged. It was found that descriptors used for emotion detection in speech are not as useful for singing. Also, an approach using Convolutional Neural Network (CNN) employing spectrogram representation of audio signals was tested. Several parameters for singing were determined, which, according to the obtained results, allow for a significant reduction in the dimensionality of feature vectors while increasing the classification efficiency of emotion detection.
机译:本文旨在检索语音描述符,可能有助于唱歌中情绪的分类。为此目的,基于RACDES数据集计算MEL频率谱系齐系数(MFCC)和选择的低级MPEG 7描述符。该数据库包含情感言论的录音,唱歌呈现出六种不同情绪的专业演员。采用基于树木林的特征选择算法,确定了具有最佳排名结果的描述符。然后,使用支持向量机(SVM)进行分类的情绪。训练是多次进行的,并且结果平均。结果发现,用于言论中的情感检测的描述符不像唱歌那么有用。此外,测试了采用音频信号的频谱图表示的使用卷积神经网络(CNN)的方法。确定用于唱歌的几个参数,根据所得结果,允许在特征向量的维度的显着降低,同时增加情绪检测的分类效率。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号