Transfer Linear Subspace Learning for Cross-Corpus Speech Emotion Recognition

Song Peng

首页> 外文期刊>Affective Computing, IEEE Transactions on >Transfer Linear Subspace Learning for Cross-Corpus Speech Emotion Recognition

【24h】

Transfer Linear Subspace Learning for Cross-Corpus Speech Emotion Recognition

机译：转移线性子空间学习，用于跨语言语音情感识别

获取原文

获取原文并翻译 | 示例

开具论文收录证明 >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

Speech emotion recognition has received an increasing interest in recent years, which is often conducted on the assumption that speech utterances in training and testing datasets are obtained under the same conditions. However, in reality, this assumption does not hold as the speech data are often collected from different devices or environments. Hence, there exists discrepancy between the training and testing data, which will have an adverse effect on recognition performance. In this paper, we examine the problem of cross-corpus speech emotion recognition. To address it, we present a novel transfer linear subspace learning (TLSL) framework to learn a common feature subspace for source and target datasets. In TLSL, a nearest neighbor graph algorithm is used to measure the similarity between different corpora, and a feature grouping strategy is introduced to divide the emotional features into two categories, i.e., high transferable part (HTP) versus low transferable part (LTP). To explore the proposed TLSL with different scenarios, we propose two kinds of TLSL approaches, called transfer unsupervised linear subspace learning (TULSL) and transfer supervised linear subspace learning (TSLSL), and provide the corresponding solutions for the optimization problems. Extensive experiments on several benchmark datasets validate the effectiveness of TLSL for cross-corpus speech emotion recognition.

机译：近年来，言语情感认可已经获得了越来越兴趣的兴趣，这通常是在同样的条件下获得训练和测试数据集中的语音话语的假设。然而，实际上，这种假设不包含，因为语音数据通常从不同的设备或环境收集。因此，培训和测试数据之间存在差异，这将对识别性能产生不利影响。在本文中，我们研究了交叉语料库语音情感识别的问题。要解决它，我们介绍了一种新颖的传输线性子空间学习（TLSL）框架，用于学习源和目标数据集的通用特征子空间。在TLSL中，使用最近的邻居图算法来测量不同基层之间的相似性，并引入特征分组策略以将情绪特征划分为两类，即高可转移部分（HTP）与低可转换部分（LTP）。为了探索具有不同方案的提议的TLSL，我们提出了两种TLSL方法，称为传输无监督的线性子空间学习（TULSL）和转移监督线性子空间学习（TSLSL），并为优化问题提供相应的解决方案。关于多个基准数据集的广泛实验验证了TLSL对跨语料语音情绪识别的有效性。

著录项

来源
《Affective Computing, IEEE Transactions on》 |2019年第2期|265-275|共11页
作者
Song Peng;
展开▼
作者单位

Yantai Univ Sch Comp & Control Engn Yantai Shi 264005 Shandong Sheng Peoples R China;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Emotion recognition; transfer learning; subspace learning; dimensionality reduction;

机译：情绪识别;转移学习;子空间学习;减少维度;

相似文献

外文文献
中文文献
专利

1. Transfer Linear Subspace Learning for Cross-Corpus Speech Emotion Recognition [J] . Song Peng Affective Computing, IEEE Transactions on . 2019,第2期

机译：转移线性子空间学习用于跨企业语音情感识别
2. Transfer Sparse Discriminant Subspace Learning for Cross-Corpus Speech Emotion Recognition [J] . Weijian Zhang, Peng Song Audio, Speech, and Language Processing, IEEE/ACM Transactions on . 2020,第期

机译：转移稀疏判别子空间学习跨语料库语音情感识别
3. Target-Adapted Subspace Learning for Cross-Corpus Speech Emotion Recognition [J] . Xiuzhen CHEN, Xiaoyan ZHOU, Cheng LU, IEICE transactions on information and systems . 2019,第12期

机译：跨团语音情感识别的目标自适应子空间学习
4. Unsupervised Cross-Corpus Speech Emotion Recognition Using Domain-Adaptive Subspace Learning [C] . Na Liu, Yuan Zong, Baofeng Zhang, IEEE International Conference on Acoustics, Speech and Signal Processing . 2018

机译：使用域 - 自适应子空间学习的无监督的交叉语音语音情感识别
5. Multilinear subspace learning for face and gait recognition [D] . Lu, Haiping 2008

机译：脸部和步态认可的多线性子空间学习
6. Fusing Visual Attention CNN and Bag of Visual Words for Cross-Corpus Speech Emotion Recognition [O] . Minji Seo, Myungho Kim 2020

机译：融合视觉关注CNN和跨语料语音情感识别的视觉词语
7. Indonesian Speech Emotion Recognition using Cross-Corpus Method with the Combination of MFCC and Teager Energy Features [O] . Oscar Utomo Kumala, Amalia Zahra 2021

机译：印度尼西亚语音情感识别使用交叉语料库方法具有MFCC和Teager能量特征的组合

Transfer Linear Subspace Learning for Cross-Corpus Speech Emotion Recognition

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅