首页> 外文期刊>Pattern recognition letters >Speech emotion recognition via learning analogies
【24h】

Speech emotion recognition via learning analogies

机译:通过学习类比语音情感识别

获取原文
获取原文并翻译 | 示例
       

摘要

This work introduces the few-shot learning paradigm in the speech emotion recognition domain. Emotional characterization of speech segments is carried out through analogies, i.e. by assessing similarities and dissimilarities between novel and known recordings. More specifically, we designed a Siamese Neural Network modeling such relationships on the combined log-Mel and temporal modulation spectrogram space. We present thorough experimentations assessing the performance of the proposed solution holistically, where it is demonstrated that it reaches state of the art rates when following the standard leave one-speaker-out protocol, while at the same time being able to operate in non-stationary conditions, i.e. with limited knowledge of speakers and/or emotional classes. Finally, we investigated the activation maps in a layer-wise manner in order to interpret the predictions made by the model.(c) 2021 Elsevier B.V. All rights reserved.
机译:这项工作介绍了语音情感识别域中的几次射击学习范例。语音段的情感表征是通过类比进行的,即通过评估新颖和已知记录之间的相似性和异化。更具体地,我们设计了暹罗神经网络在组合的日志MEL和时间调制谱图空间上进行这种关系。我们呈现彻底的实验,在全面评估所提出的解决方案的性能,在那里证明它在遵循标准留出一扬声器输出协议时达到最先进的速率,同时能够以非静止方式运行条件,即扬声器和/或情绪课程的知识有限。最后,我们以层式方式调查了激活图,以解释模型所做的预测。(c)2021 Elsevier B.v.保留所有权利。

著录项

获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号