Separation of the Latent Representations into 'Identity' and 'Expression' without Emotional Labels

机译：没有情绪标签的潜在思想与“身份”和“表达”的分离

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Learning semantically disentangled representations is important for various computer vision tasks, such as image generation and classification. Although it is possible to learn an effective representation in supervised settings, there are problems requiring enormous effort focused in data collection and labeling, and the difficulty in labeling continuously changing events like facial expressions is significant. In this paper, we propose a method for separating the latent representation of facial images into identity factors and facial expression factors using the variational autoencoder (VAE) framework. In our method, we only use subject labels to control training, and we do not use information attached to facial expressions like emotional labels. The separation between extracted facial expression factors and identity features is very useful for controlling image generation and for classifying facial expressions. Using this latent representation, we also suggest a new approach for facial expression recognition with a simple clustering method, which is based on Euclidean distance. Our classification method dramatically reduces the cost of labeling. The experimental results show that our method successfully disentangles the representation of facial images and separates the latent representation into identity and facial expression factors. Moreover, in a facial expression recognition task, our approach shows advantages over the baseline method without supervision.

机译：学习语义解除不信而信的表示对于各种计算机愿景任务非常重要，例如图像生成和分类。虽然可以在监督设置中学习有效的表示，但是有需要巨大的努力专注于数据收集和标签的问题，并且标记像面部表情等持续改变事件的难度是显着的。在本文中，我们提出了一种使用变分性AutoEncoder（VAE）框架将面部图像的潜在因子和面部表达因子分离的方法。在我们的方法中，我们只使用主题标签来控制培训，我们不使用与情感标签等面部表情附加的信息。提取的面部表情因子和身份特征之间的分离对于控制图像生成和分类面部表情非常有用。使用这种潜在的表示，我们还建议使用简单的聚类方法来提出一种新的面部表情识别方法，这是基于欧几里德距离的。我们的分类方法显着降低了标签的成本。实验结果表明，我们的方法成功地解除了面部图像的表示，并将潜在的表示分解成身份和面部表情。此外，在面部表情识别任务中，我们的方法显示出在没有监督的基线方法上的优势。

著录项

来源
《IEEE International Conference on Systems, Man, and Cybernetics》|2020年|1638-1644|共7页
会议地点
作者
Yoshihisa Kanou; Tomoharu Nagao;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
disentangled representation; facial expression;

机译：解开代表;面部表情;

相似文献

外文文献
中文文献
专利

1. Tracking the separation of visual representations of face identity and emotional expression in real time [J] . Katie Fisher, John Towler, Martin Eimer Journal of vision . 2015,第12期

机译：实时跟踪面部识别和情感表达的视觉表示分离
2. Using Spatial Frequency to Distinguish the Perceptual Representations of Identity and Emotional Expressions [J] . Danelle A. Wilbraham, James T. Todd Journal of vision . 2010,第7期

机译：使用空间频率区分同一性和情感表达的知觉表示
3. Using Spatial Frequency to Distinguish the Perceptual Representations of Identity and Emotional Expressions [J] . Danelle A. Wilbraham, James T. Todd Journal of vision . 2010,第7期

机译：使用空间频率区分同一性和情感表达的知觉表示
4. Latent Identity Variables: Biometric Matching Without Explicit Identity Estimation [C] . Simon J.D. Prince, Jania Aghajanian, Umar Mohammed, International Conference on Advances in Biometrics(ICB 2007); 20070827-29; Seoul(KR) . 2007

机译：潜在的身份变量：没有明确身份估计的生物特征匹配
5. Psychopathy as a female phenotypic expression of borderline personality disorder? Implications for the latent structure of emotional dysregulation. [D] . Sprague, Jenessa C. 2013

机译：精神病是边缘型人格障碍的女性表型表达吗？对情绪失调的潜在结构的影响。
6. Emotional Actions Are Coded via Two Mechanisms: With and without Identity Representation [O] . Joanna Wincenciak, Jennie Ingham, Tjeerd Jellema, -1

机译：情感行为通过两种机制进行编码：有和无身份表示
7. Emotional Actions Are Coded via Two Mechanisms: With and without Identity Representation [O] . Joanna Wincenciak, Jennie Ingham, Tjeerd Jellema, 2016

机译：情绪行为通过两种机制进行编码：有和没有身份代表

Separation of the Latent Representations into 'Identity' and 'Expression' without Emotional Labels

摘要

著录项

相似文献

相关主题

期刊订阅