Nose, Eyes and Ears: Head Pose Estimation by Locating Facial Keypoints

机译：鼻子，眼睛和耳朵：通过定位面部关键点来估计头姿势

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Monocular head pose estimation requires learning a model that computes the intrinsic Euler angles for pose (yaw, pitch, roll) from an input image of human face. Annotating ground truth head pose angles for images in the wild is difficult and requires ad-hoc fitting procedures (which provides only coarse and approximate annotations). This highlights the need for approaches which can train on data captured in controlled environment and generalize on the images in the wild (with varying appearance and illumination of the face). Most present day deep learning approaches which learn a regression function directly on the input images fail to do so. To this end, we propose to use a higher level representation to regress the head pose while using deep learning architectures. More specifically, we use the uncertainty maps in the form of 2D soft localization heatmap images over five facial key-points, namely left ear, right ear, left eye, right eye and nose, and pass them through an convolutional neural network to regress the head-pose. We show head pose estimation results on two challenging benchmarks BIWI and AFLW and our approach surpasses the state of the art on both the datasets.

机译：单眼头部姿势估计需要学习一个模型，该模型根据人脸的输入图像计算姿势（偏航，俯仰，横滚）的固有欧拉角。为野外图像标注地面真相头部姿势角度非常困难，并且需要临时拟合程序（该过程仅提供粗略和近似的标注）。这突显了对可以训练在受控环境中捕获的数据并概括野外图像（具有变化的外观和面部照明）的方法的需求。如今，大多数直接在输入图像上学习回归函数的深度学习方法都无法做到这一点。为此，我们建议在使用深度学习体系结构时使用更高级别的表示来回归头部姿势。更具体地讲，我们在五个面部关键点（即左耳，右耳，左眼，右眼和鼻子）上使用2D软定位热图图像形式的不确定性图，并将其通过卷积神经网络进行回归。头姿势。我们在两个具有挑战性的基准BIWI和AFLW上显示了头部姿势估计结果，并且我们的方法在这两个数据集上都超过了现有技术。

著录项

来源
《IEEE International Conference on Acoustics, Speech and Signal Processing》|2019年|1977-1981|共5页
会议地点
作者
Aryaman Gupta; Kalpit Thakkar; Vineet Gandhi; P J Narayanan;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
convolutional neural nets; face recognition; image annotation; learning (artificial intelligence); pose estimation; regression analysis;

机译：卷积神经网络;人脸识别;图像标注;学习（人工智能）;姿态估计;回归分析;

相似文献

外文文献
中文文献
专利

1. Locating Nose-Tips and Estimating Head Poses in Images by Tensorposes [J] . Tu J., Fu Y., Huang T. S. IEEE Transactions on Circuits and Systems for Video Technology . 2009,第1期

机译：通过张量定位鼻尖并估计图像中的头部姿势
2. Head, Eyes, Ears, Nose, and Throat Emergencies [J] . Emergency medicine clinics of North America . 2013,第2期

机译：头部，眼睛，耳朵，鼻子和喉咙紧急情况
3. Fast Head Pose Estimation via Rotation-Adaptive Facial Landmark Detection for Video Edge Computation [J] . Wang Weiwei, Chen Xiaoyan, Zheng Shuangwu, Quality Control, Transactions . 2020,第期

机译：通过旋转自适应面部地标检测进行视频边缘计算的快速头姿态估计
4. Nose, Eyes and Ears: Head Pose Estimation by Locating Facial Keypoints [C] . Aryaman Gupta, Kalpit Thakkar, Vineet Gandhi, IEEE International Conference on Acoustics, Speech and Signal Processing . 2019

机译：鼻子，眼睛和耳朵：通过定位面部键点来姿势估计
5. Head pose determination, feature point tracking, and eye gaze estimation for human-computer interaction [D] . Reale, Michael J. 2009

机译：人机交互的头部姿势确定，特征点跟踪和视线估计
6. Head Pose Estimation through Keypoints Matching between Reconstructed 3D Face Model and 2D Image [O] . Leyuan Liu, Zeran Ke, Jiao Huo, 2021

机译：通过重建3D面部模型和2D图像之间的关键点匹配的头部姿态估计
7. Nose, Eyes and Ears: Head Pose Estimation by Locating Facial Keypoints [O] . Aryaman Gupta, Kalpit Thakkar, Vineet Gandhi, 2019

机译：鼻子，眼睛和耳朵：通过定位面部键点来姿势估计

Nose, Eyes and Ears: Head Pose Estimation by Locating Facial Keypoints

摘要

著录项

相似文献

相关主题

期刊订阅