Learning emotion-based acoustic features with deep belief networks

机译：学习基于情感的声学功能，具有深度信仰网络

获取原文

获取外文期刊封面目录资料

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

The medium of music has evolved specifically for the expression of emotions, and it is natural for us to organize music in terms of its emotional associations. But while such organization is a natural process for humans, quantifying it empirically proves to be a very difficult task, and as such no dominant feature representation for music emotion recognition has yet emerged. Much of the difficulty in developing emotion-based features is the ambiguity of the ground-truth. Even using the smallest time window, opinions on the emotion are bound to vary and reflect some disagreement between listeners. In previous work, we have modeled human response labels to music in the arousal-valence (A-V) representation of affect as a time-varying, stochastic distribution. Current methods for automatic detection of emotion in music seek performance increases by combining several feature domains (e.g. loudness, timbre, harmony, rhythm). Such work has focused largely in dimensionality reduction for minor classification performance gains, but has provided little insight into the relationship between audio and emotional associations. In this new work we seek to employ regression-based deep belief networks to learn features directly from magnitude spectra. While the system is applied to the specific problem of music emotion recognition, it could be easily applied to any regression-based audio feature learning problem.

机译：音乐媒介专门演变为表达情绪，我们很自然地在其情绪协会方面组织音乐。但是，虽然这种组织是人类的自然过程，但量化它经验证明是一项非常艰巨的任务，并且由于音乐情感认可的主导特征表示尚未出现。发展情感的特征的大部分困难是地面真理的歧义。甚至使用最小的时间窗口，对情绪的意见必将有所不同，并反映听众之间的一些分歧。在以前的工作中，我们将人力响应标签建模到唤醒（A-V）表示影响作为时变的随机分布的音乐。通过组合若干特征域（例如响度，音色，和谐，节奏），在音乐中自动检测音乐情绪的现有方法增加。此类工作主要集中在很大程度上，对小分类绩效收益的维度降低，但对音频和情感协会之间的关系提供了很少的洞察力。在这项新工作中，我们寻求采用基于回归的深度信念网络，以直接从幅度谱学习特征。虽然系统适用于音乐情感识别的特定问题，但它可以很容易地应用于任何基于回归的音频特征学习问题。

著录项

来源
《IEEE Workshop on Applications of Signal Processing to Audio and Acoustics》|2011年||共4页
会议地点
作者
(missing);
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TN912-53;
关键词
Emotion recognition; deep belief networks; feature learning; regression;

机译：情绪识别;深度信仰网络;特色学习;回归;

相似文献

外文文献
中文文献
专利

1. Learning High-Level Feature by Deep Belief Networks for 3-D Model Retrieval and Recognition [J] . Bu S., Liu Z., Han J., Multimedia, IEEE Transactions on . 2014,第8期

机译：通过深度信念网络学习高级特征以进行3-D模型检索和识别
2. Rolling bearing fault feature learning using improved convolutional deep belief network with compressed sensing [J] . Haidong Shao, Hongkai Jiang, Haizhou Zhang, Mechanical systems and signal processing . 2018,第FEBa1期

机译：改进的带压缩感知的卷积深度置信网络在滚动轴承故障特征学习中的应用
3. Deep belief network based statistical feature learning for fingerprint liveness detection [J] . Kim Soowoong, Park Bogun, Song Bong Seop, Pattern recognition letters . 2016,第jula1期

机译：基于深度信念网络的统计特征学习用于指纹活度检测
4. Learning emotion-based acoustic features with deep belief networks [C] . 2011 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics . 2011

机译：使用深层信念网络学习基于情感的声学特征
5. Internal and External Feature Engineering Applied to Deep Learning with Convolutional Neural Networks for Monocular Relative Pose Estimation in Visual Odometry and Self-Localization [D] . Parkins, Franz Payton. 2020

机译：内部和外部特征工程应用于卷积神经网络的深度学习，用于视觉测量和自定位中的单眼相对姿态估计
6. Task-specific feature extraction and classification of fMRI volumes using a deep neural network initialized with a deep belief network: Evaluation using sensorimotor tasks [O] . Hojin Jang, Sergey M. Plis, Vince D. Calhoun, -1

机译：使用初始化为深度置信网络的深度神经网络对功能磁共振成像体积进行任务特定的特征提取和分类：使用感觉运动任务进行评估
7. Learning emotion-based acoustic features with deep belief networks [O] . Erik M. Schmidt, Youngmoo E. Kim 2011

机译：通过深入的信念网络学习基于情感的声学特征

Learning emotion-based acoustic features with deep belief networks

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅