首页> 外文会议>IEEE International Conference on Systems, Man, and Cybernetics >Separation of the Latent Representations into 'Identity' and 'Expression' without Emotional Labels
【24h】

Separation of the Latent Representations into 'Identity' and 'Expression' without Emotional Labels

机译:没有情绪标签的潜在思想与“身份”和“表达”的分离

获取原文

摘要

Learning semantically disentangled representations is important for various computer vision tasks, such as image generation and classification. Although it is possible to learn an effective representation in supervised settings, there are problems requiring enormous effort focused in data collection and labeling, and the difficulty in labeling continuously changing events like facial expressions is significant. In this paper, we propose a method for separating the latent representation of facial images into identity factors and facial expression factors using the variational autoencoder (VAE) framework. In our method, we only use subject labels to control training, and we do not use information attached to facial expressions like emotional labels. The separation between extracted facial expression factors and identity features is very useful for controlling image generation and for classifying facial expressions. Using this latent representation, we also suggest a new approach for facial expression recognition with a simple clustering method, which is based on Euclidean distance. Our classification method dramatically reduces the cost of labeling. The experimental results show that our method successfully disentangles the representation of facial images and separates the latent representation into identity and facial expression factors. Moreover, in a facial expression recognition task, our approach shows advantages over the baseline method without supervision.
机译:学习语义解除不信而信的表示对于各种计算机愿景任务非常重要,例如图像生成和分类。虽然可以在监督设置中学习有效的表示,但是有需要巨大的努力专注于数据收集和标签的问题,并且标记像面部表情等持续改变事件的难度是显着的。在本文中,我们提出了一种使用变分性AutoEncoder(VAE)框架将面部图像的潜在因子和面部表达因子分离的方法。在我们的方法中,我们只使用主题标签来控制培训,我们不使用与情感标签等面部表情附加的信息。提取的面部表情因子和身份特征之间的分离对于控制图像生成和分类面部表情非常有用。使用这种潜在的表示,我们还建议使用简单的聚类方法来提出一种新的面部表情识别方法,这是基于欧几里德距离的。我们的分类方法显着降低了标签的成本。实验结果表明,我们的方法成功地解除了面部图像的表示,并将潜在的表示分解成身份和面部表情。此外,在面部表情识别任务中,我们的方法显示出在没有监督的基线方法上的优势。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号