首页> 外文会议>European conference on computer vision >Deep Bimodal Regression for Apparent Personality Analysis
【24h】

Deep Bimodal Regression for Apparent Personality Analysis

机译:深度双峰回归表观人格分析

获取原文

摘要

Apparent personality analysis from short video sequences is a challenging problem in computer vision and multimedia research. In order to capture rich information from both the visual and audio modality of videos, we propose the Deep Bimodal Regression (DBR) framework. In DBR, for the visual modality, we modify the traditional convolutional neural networks for exploiting important visual cues. In addition, taking into account the model efficiency, we extract audio representations and build the linear regressor for the audio modality. For combining the complementary information from the two modalities, we ensemble these predicted regression scores by both early fusion and late fusion. Finally, based on the proposed framework, we come up with a solution for the Apparent Personality Analysis competition track in the ChaLearn Looking at People challenge in association with ECCV 2016. Our DBR is the winner (first place) of this challenge with 86 registered teams.
机译:短视频序列的表观人格分析是计算机视觉和多媒体研究中的一个具有挑战性的问题。为了从视频的视觉和音频模型中捕获丰富的信息,我们提出了深度双峰回归(DBR)框架。在DBR中,对于视觉模态,我们修改传统的卷积神经网络,以利用重要的视觉提示。此外,考虑到模型效率,我们提取音频表示并构建音频模块的线性回归。结合两种方式的互补信息,我们通过早期融合和晚期融合来集合这些预测的回归分数。最后,基于拟议的框架,我们提出了一个解决方案,即在Chalearn中寻找与Eccv联合的Chalearn挑战的明显人格分析竞争轨道。我们的DBR是这项挑战的胜利者(第一名)与86名注册团队有关这一挑战。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号