首页> 外文会议>Spoken Language Technology Workshop >Domain Generalization with Triplet Network for Cross-Corpus Speech Emotion Recognition

【24h】

Domain Generalization with Triplet Network for Cross-Corpus Speech Emotion Recognition

机译：具有三联网网络的域概括，用于跨语料库语音情感识别

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Domain generalization is a major challenge for cross-corpus speech emotion recognition. The recognition performance built on "seen" source corpora is inevitably degraded when the systems are tested against "unseen" target corpora that have different speakers, channels, and languages. We present a novel framework based on a triplet network to learn more generalized features of emotional speech that are invariant across multiple corpora. To reduce the intrinsic discrepancies between source and target corpora, an explicit feature transformation based on the triplet network is implemented as a preprocessing step. Extensive comparison experiments are carried out on three emotional speech corpora; two English corpora, and one Japanese corpus. Remarkable improvements of up-to 35.61% are achieved for all cross-corpus speech emotion recognition, and we show that the proposed framework using the triplet network is effective for obtaining more generalized features across multiple emotional speech corpora.

机译：域概括是交叉语料库语音情感认可的主要挑战。当系统经过有不同扬声器，渠道和语言的“看不见的”目标语料库测试时，建立在“看到”源语料库上建立的识别表现不可避免地降低。我们介绍了一个基于三联网网络的新颖框架，以了解多大语料中不变的情绪语音的更广泛的特征。为了降低源代码和目标语料库之间的内在差异，基于Triplet网络的显式功能转换被实现为预处理步骤。广泛的比较实验是在三个情绪语音上进行的;两个英文语料库和一个日本语料库。所有交叉语料库语音情感识别都会实现高达35.61％的显着改善，我们表明，使用Triplet网络的建议框架可有效地在多个情绪语音上获得更多的广义特征。

著录项

来源
《Spoken Language Technology Workshop》|2021年|389-396|共8页
会议地点
作者
Shi-wook Lee;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Emotion recognition; Target recognition; Conferences; Neural networks; Speech recognition;

机译：情绪识别;目标识别;会议;神经网络;语音识别;
入库时间 2022-08-26 13:52:51

相似文献

外文文献
中文文献
专利

1. 用于跨库语音情感识别的时频原子听觉注意模型 [J] . 张昕然, 宋鹏, 查诚, 东南大学学报（英文版） . 2016,第004期
2. Cross-Corpus Speech Emotion Recognition Based on Deep Domain-Adaptive Convolutional Neural Network [J] . Jiateng LIU, Wenming ZHENG, Yuan ZONG, IEICE transactions on information and systems . 2020,第2期

机译：基于深域自适应卷积神经网络的交叉语料库语音情感识别
3. Transfer Sparse Discriminant Subspace Learning for Cross-Corpus Speech Emotion Recognition [J] . Weijian Zhang, Peng Song Audio, Speech, and Language Processing, IEEE/ACM Transactions on . 2020,第期

机译：转移稀疏判别子空间学习跨语料库语音情感识别
4. Target-Adapted Subspace Learning for Cross-Corpus Speech Emotion Recognition [J] . Xiuzhen CHEN, Xiaoyan ZHOU, Cheng LU, IEICE transactions on information and systems . 2019,第12期

机译：跨团语音情感识别的目标自适应子空间学习
5. Unsupervised Cross-Corpus Speech Emotion Recognition Using Domain-Adaptive Subspace Learning [C] . Na Liu, Yuan Zong, Baofeng Zhang, IEEE International Conference on Acoustics, Speech and Signal Processing . 2018

机译：使用域 - 自适应子空间学习的无监督的交叉语音语音情感识别
6. Domain Adaptation for Speech Based Emotion Recognition [D] . Abdelwahab, Mohammed. 2019

机译：基于语音情感识别的域适应
7. Fusing Visual Attention CNN and Bag of Visual Words for Cross-Corpus Speech Emotion Recognition [O] . Minji Seo, Myungho Kim 2020

机译：融合视觉关注CNN和跨语料语音情感识别的视觉词语
8. Cross-Corpus Speech Emotion Recognition Based on Deep Domain-Adaptive Convolutional Neural Network [O] . Jiateng LIU, Wenming ZHENG, Yuan ZONG, 2020

机译：基于深域自适应卷积神经网络的交叉语料库语音情感识别
9. Memristive Computational Architecture of an Echo State Network for Real-Time Speech Emotion Recognition. [R] . Saleh, Q., Merkel, C., Kudithipudi, D., 2015

机译：用于实时语音情感识别的回声状态网络的忆阻计算结构。

Domain Generalization with Triplet Network for Cross-Corpus Speech Emotion Recognition

摘要

著录项

相似文献

相关主题

期刊订阅