首页> 外文会议>2018 First Asian Conference on Affective Computing and Intelligent Interaction >Speech Emotion Recognition Using Semi-supervised Learning with Ladder Networks
【24h】

Speech Emotion Recognition Using Semi-supervised Learning with Ladder Networks

机译:梯形网络中半监督学习的语音情感识别

获取原文
获取原文并翻译 | 示例

摘要

As a major branch of speech processing, speech emotion recognition has drawn much attention of researchers. Prior works have proposed a variety of models and feature sets for training a system. In this paper, we propose to use semi-supervised learning with ladder networks to generate robust feature representation for speech emotion recognition. In our method, the input of ladder network is the normalized static acoustic features and is mapped to high level hidden representations. The model is trained to simultaneously minimize the sum of supervised and unsupervised cost functions by back-propagation. The extracted hidden representations are used as emotional features in SVM model for speech emotion recognition. The experimental results, performed on IEMOCAP database, show 2.6% higher performance than denoising auto-encoder, and 5.3% than the static acoustic features.
机译:语音情感识别作为语音处理的主要分支,引起了研究者的广泛关注。先前的工作提出了用于训练系统的各种模型和特征集。在本文中,我们建议使用带有梯形网络的半监督学习来生成用于语音情感识别的鲁棒特征表示。在我们的方法中,梯形网络的输入是归一化的静态声学特征,并映射到高级隐藏表示。对模型进行训练,以通过反向传播同时最小化监督和非监督成本函数的总和。提取的隐藏表示用作SVM模型中的情感特征,以进行语音情感识别。在IEMOCAP数据库上执行的实验结果显示,比降噪自动编码器的性能高2.6%,比静态声学功能的性能高5.3%。

著录项

  • 来源
  • 会议地点 Beijing(CN)
  • 作者单位

    National Laboratory of Pattern Recognition, (NLPR), School of Artificial Intelligence, Institute of Automation, CAS, University of Chinese Academy of Sciences, Beijing, China;

    National Laboratory of Pattern Recognition, (NLPR), School of Artificial Intelligence, Institute of Automation, CAS, University of Chinese Academy of Sciences, Beijing, China;

    National Laboratory of Pattern Recognition, (NLPR), School of Artificial Intelligence, CAS Center for Excellence in Brain Science and Intelligence Technology, Institute of Automation, CAS, University of Chinese Academy of Sciences, Beijing, China;

    National Laboratory of Pattern Recognition, (NLPR), School of Artificial Intelligence, Institute of Automation, CAS, University of Chinese Academy of Sciences, Beijing, China;

    National Laboratory of Pattern Recognition, (NLPR), School of Artificial Intelligence, Institute of Automation, CAS, University of Chinese Academy of Sciences, Beijing, China;

    National Laboratory of Pattern Recognition, (NLPR), School of Artificial Intelligence, Institute of Automation, CAS, University of Chinese Academy of Sciences, Beijing, China;

  • 会议组织
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

    Feature extraction; Emotion recognition; Speech recognition; Acoustics; Decoding; Automation; Supervised learning;

    机译:特征提取;情感识别;语音识别;声学;解码;自动化;监督学习;;
  • 入库时间 2022-08-26 14:26:36

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号