首页> 外文期刊>Affective Computing, IEEE Transactions on >Transfer Linear Subspace Learning for Cross-Corpus Speech Emotion Recognition
【24h】

Transfer Linear Subspace Learning for Cross-Corpus Speech Emotion Recognition

机译:转移线性子空间学习,用于跨语言语音情感识别

获取原文
获取原文并翻译 | 示例
       

摘要

Speech emotion recognition has received an increasing interest in recent years, which is often conducted on the assumption that speech utterances in training and testing datasets are obtained under the same conditions. However, in reality, this assumption does not hold as the speech data are often collected from different devices or environments. Hence, there exists discrepancy between the training and testing data, which will have an adverse effect on recognition performance. In this paper, we examine the problem of cross-corpus speech emotion recognition. To address it, we present a novel transfer linear subspace learning (TLSL) framework to learn a common feature subspace for source and target datasets. In TLSL, a nearest neighbor graph algorithm is used to measure the similarity between different corpora, and a feature grouping strategy is introduced to divide the emotional features into two categories, i.e., high transferable part (HTP) versus low transferable part (LTP). To explore the proposed TLSL with different scenarios, we propose two kinds of TLSL approaches, called transfer unsupervised linear subspace learning (TULSL) and transfer supervised linear subspace learning (TSLSL), and provide the corresponding solutions for the optimization problems. Extensive experiments on several benchmark datasets validate the effectiveness of TLSL for cross-corpus speech emotion recognition.
机译:近年来,言语情感认可已经获得了越来越兴趣的兴趣,这通常是在同样的条件下获得训练和测试数据集中的语音话语的假设。然而,实际上,这种假设不包含,因为语音数据通常从不同的设备或环境收集。因此,培训和测试数据之间存在差异,这将对识别性能产生不利影响。在本文中,我们研究了交叉语料库语音情感识别的问题。要解决它,我们介绍了一种新颖的传输线性子空间学习(TLSL)框架,用于学习源和目标数据集的通用特征子空间。在TLSL中,使用最近的邻居图算法来测量不同基层之间的相似性,并引入特征分组策略以将情绪特征划分为两类,即高可转移部分(HTP)与低可转换部分(LTP)。为了探索具有不同方案的提议的TLSL,我们提出了两种TLSL方法,称为传输无监督的线性子空间学习(TULSL)和转移监督线性子空间学习(TSLSL),并为优化问题提供相应的解决方案。关于多个基准数据集的广泛实验验证了TLSL对跨语料语音情绪识别的有效性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号