AN ENSEMBLE FRAMEWORK OF VOICE-BASED EMOTION RECOGNITION SYSTEM FOR FILMS AND TV PROGRAMS

机译：电影和电视节目的基于语音情感识别系统的合奏框架

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Employing voice-based emotion recognition function in artificial intelligence (AI) product will improve the user experience. Most of researches that have been done only focus on the speech collected under controlled conditions. The scenarios evaluated in these research were well controlled. The conventional approach may fail when background noise or non-speech filler exist. In this paper, we propose an ensemble framework combining several aspects of features from audio. The framework incorporates gender and speaker information relying on multi-task learning. Therefore it is able to dig and capture emotional information as much as possible. This framework is evaluated on multimodal emotion challenge (MEC) 2017 corpus which is close to real world. The proposed framework outperformed the best baseline system by 29.5% (relative improvement).

机译：在人工智能（AI）产品中采用基于语音的情感识别功能将提高用户体验。已经完成的大多数研究仅关注在受控条件下收集的演讲。这些研究中评估的情景很好控制。当存在背景噪声或非语音填充时，传统方法可能会失败。在本文中，我们提出了一个组合框架，将来自音频的功能的若干方面组合起来。该框架包含依赖多任务学习的性别和演讲者信息。因此，它能够尽可能地挖掘和捕捉情绪信息。这一框架是在靠近现实世界的2017年核心挑战（MEC）挑战（MEC）。拟议的框架优先于最佳基线系统（相对改进）。

著录项

来源
《IEEE International Conference on Acoustics, Speech and Signal Processing》|2018年|5739-6377p|共5页
会议地点
作者
Fei Tao; Gang Liu; Qingen Zhao;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TN912-53;
关键词
multi-task learning; attention model; ensemble framework; deep learning; emotion recognition;

机译：多任务学习;注意模型;集成框架;深学习;情感识别;

相似文献

外文文献
中文文献
专利

1. A 3D-convolutional neural network framework with ensemble learning techniques for multi-modal emotion recognition [J] . Elham S. Salama, Reda A. El-Khoribi, Mahmoud E. Shoman, Egyptian Informatics Journal . 2021,第2期

机译：一种3D卷积神经网络框架，具有用于多模态情绪识别的集合学习技巧
2. Recognition of Emotions Using Multichannel EEG Data and DBN-GC-Based Ensemble Deep Learning Framework [J] . Hao Chao, Huilai Zhi, Liang Dong, Computational intelligence and neuroscience . 2018,第1期

机译：使用多通道EEG数据和基于DBN-GC的集成深度学习框架识别情绪
3. Recognition of Emotions Using Multichannel EEG Data and DBN-GC-Based Ensemble Deep Learning Framework [J] . Chao Hao, Zhi Huilai, Dong Liang, Computational intelligence and neuroscience . 2018,第Pta3期

机译：使用多通道EEG数据和基于DBN-GC的集合深度学习框架的情感认识
4. AN ENSEMBLE FRAMEWORK OF VOICE-BASED EMOTION RECOGNITION SYSTEM FOR FILMS AND TV PROGRAMS [C] . Fei Tao, Gang Liu, Qingen Zhao IEEE International Conference on Acoustics, Speech and Signal Processing . 2018

机译：电影和电视节目的基于语音情感识别系统的合奏框架
5. It takes two to tango: International co-production of feature films and TV programs in the People's Republic of China [D] . Zhou, Xiaojuan 1999

机译：探戈需要两个步骤：在中国共同制作故事片和电视节目
6. Recognition of Emotions Using Multichannel EEG Data and DBN-GC-Based Ensemble Deep Learning Framework [O] . Hao Chao, Huilai Zhi, Liang Dong, 2018

机译：使用多通道EEG数据和基于DBN-GC的集成深度学习框架识别情绪
7. An Ensemble Framework of Voice-Based Emotion Recognition System for Films and TV Programs [O] . Fei Tao, Gang Liu, Qingen Zhao 2018

机译：电影和电视节目的基于语音情感识别系统的合奏框架

AN ENSEMBLE FRAMEWORK OF VOICE-BASED EMOTION RECOGNITION SYSTEM FOR FILMS AND TV PROGRAMS

摘要

著录项

相似文献

相关主题

期刊订阅