首页> 外文会议>Symposium on Application Accelerators in High Performance Computing >Parallelizing Principal Component Analysis for Robust Facial Recognition Using CUDA
【24h】

Parallelizing Principal Component Analysis for Robust Facial Recognition Using CUDA

机译:采用CUDA的强大面部识别并行化主成分分析

获取原文

摘要

Facial recognition techniques are of interest for tracking and identification in densely populated areas where security is an important concern. Traditional recognition techniques have yielded acceptable results with high repeatability but require special conditions such as a voluntary and stationary subject, close proximity, and appropriate lighting. Because no single algorithm yields robust results under uncontrolled conditions, more than one algorithm must be considered. Three popular template-based algorithms involved in facial recognition include Principal Component Analysis (PCA), Linear Discriminant Analysis (LDA), and Independent Component Analysis (ICA). Since facial recognition algorithms require processor-intensive and complicated matrix calculations, these three algorithms could be improved with hardware that can accelerate these calculations. The PCA algorithm studied in this research is a common mathematical method that has been parallelized for other applications, but not for the purposes of facial recognition using General Purpose Graphics Processing Units (GPGPUs). NVIDIA's CUDA parallel computing architecture is employed to implement the PCA algorithm for the GPGPU device. A C implementation of the PCA algorithm has been optimized specifically for use with CUDA kernels and future parallelization in mind. The results provided in this paper show that the parallel GPGPU implementation outperforms the multithreaded C implementation on a general purpose CPU.
机译:面部识别技术对于在稠密的地区进行跟踪和识别的兴趣感兴趣,安全性是一个重要问题。传统的识别技术具有高可重复性的可接受的结果,但需要特殊条件,例如自愿和静止的主体,靠近和适当的照明。因为在不受控制的条件下没有单次算法产生稳健的结果,因此必须考虑多于一种算法。面部识别中涉及的三种流行的基于模板的算法包括主成分分析(PCA),线性判别分析(LDA)和独立分量分析(ICA)。由于面部识别算法需要处理器密集型和复杂的矩阵计算,因此可以通过可以加速这些计算的硬件来改进这三种算法。本研究中研究的PCA算法是一种常见的数学方法,它已经对其他应用并行化,而不是使用通用图形处理单元(GPGPU)的面部识别的目的。使用NVIDIA的CUDA并行计算架构用于实现GPGPU设备的PCA算法。 PCA算法的C实现已经针对CUDA内核和未来的并行化进行了优化。本文提供的结果表明,并行GPGPU实现优于通用CPU上的多线程C实现。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号