首页> 外文会议>2017 IEEE International Joint Conference on Biometrics >In defense of low-level structural features and SVMs for facial attribute classification: Application to detection of eye state, Mouth State, and eyeglasses in the wild
【24h】

In defense of low-level structural features and SVMs for facial attribute classification: Application to detection of eye state, Mouth State, and eyeglasses in the wild

机译:防御低级结构特征和SVM以进行面部属性分类:在野外检测眼睛状态,嘴巴状态和眼镜的应用

获取原文
获取原文并翻译 | 示例

摘要

The current trend in image analysis is to employ automatically detected feature types, such as those obtained using deep-learning techniques. For some applications, however, manually crafted features such as Histogram of Oriented Gradients (HOG) continue to yield better performance in demanding situations. This paper considers both approaches for the problem of facial attribute classification, for images obtained “in the wild.” Attributes of particular interest are eye state (open/closed), mouth state (open/closed), and eyeglasses (present/absent). We present a full face-processing pipeline that employs conventional machine learning techniques, from detection to attribute classification. Experimental results have indicated better performance using RootSIFT with a conventional support-vector machine (SVM) approach, as compared to deep-learning approaches that have been reported in the literature. Our proposed open/closed eye classifier has yielded an accuracy of 99.3% on the CEW dataset, and an accuracy of 98.7% on the ZJU dataset. Similarly, our proposed open/closed mouth classifier has achieved performance similar to deep learning. Also, our proposed presence/absence eyeglasses classifier delivered very good performance, being the best method on LFWA, and second best for the CelebA dataset. The system reported here runs at 30 fps on HD-sized video using a CPU-only implementation.
机译:图像分析的当前趋势是采用自动检测的特征类型,例如使用深度学习技术获得的那些特征类型。但是,对于某些应用程序而言,手动制作的功能(例如,定向梯度直方图(HOG))在苛刻的情况下仍会产生更好的性能。本文针对“在野外”获得的图像,考虑了针对面部属性分类问题的两种方法。特别感兴趣的属性是眼睛状态(打开/关闭),嘴部状态(打开/关闭)和眼镜(当前/不存在)。我们提出了一个全脸处理管道,该管道采用了传统的机器学习技术,从检测到属性分类。与文献中报道的深度学习方法相比,实验结果表明,使用RootSIFT与常规支持向量机(SVM)方法相比,性能更好。我们提出的睁眼/闭眼分类器在CEW数据集上的准确性为99.3%,在ZJU数据集上的准确性为98.7%。类似地,我们提出的张/闭嘴分类器已经实现了与深度学习相似的性能。此外,我们提出的存在/不存在眼镜分类器提供了非常好的性能,这是LFWA上的最佳方法,而CelebA数据集的第二好方法。此处报告的系统使用仅CPU的实现在高清视频上以30 fps的速度运行。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号