首页> 外文会议>International Conference on Embedded Computer Systems: Architectures, Modeling and Simulation >An embedded hardware-efficient architecture for real-time cascade Support Vector Machine classification
【24h】

An embedded hardware-efficient architecture for real-time cascade Support Vector Machine classification

机译:用于实时级联支持向量机分类的嵌入式硬件高效架构

获取原文

摘要

Support Vector Machines (SVMs) are considered as a state-of-the-art classification algorithm yielding high accuracy rates. However, SVMs often require processing a large number of support vectors, making the classification process computationally demanding, especially when considering embedded applications. Cascade SVMs have been proposed in an attempt to speed-up classification times, but improved performance comes at a cost of additional hardware resources. Consequently, in this paper we propose an optimized architecture for cascaded SVM processing, along with a hardware reduction method in order to reduce the overheads from the implementation of additional stages in the cascade, leading to significant resource and power savings for embedded applications. The architecture was implemented on a Virtex 5 FPGA platform and evaluated using face detection as the target application on 640×480 resolution images. Additionally, it was compared against implementations of the same cascade processing architecture but without using the reduction method, and a single parallel SVM classifier. The proposed architecture achieves an average performance of 70 frames-per-second, demonstrating a speed-up of 5× over the single parallel SVM classifier. Furthermore, the hardware reduction method results in the utilization of 43% less hardware resources and a 20% reduction in power, with only 0.7% reduction in classification accuracy.
机译:支持向量机(SVM)被认为是产生高准确率的最新分类算法。但是,SVM通常需要处理大量的支持向量,从而使分类过程在计算上变得非常困难,尤其是在考虑嵌入式应用程序时。已经提出了级联SVM,以试图加快分类时间,但是要提高性能,就要付出额外硬件资源的代价。因此,在本文中,我们提出了一种用于级联SVM处理的优化架构,以及一种硬件缩减方法,以减少级联中附加阶段的实现所带来的开销,从而为嵌入式应用程序节省大量资源和功耗。该架构是在Virtex 5 FPGA平台上实现的,并使用人脸检测作为目标应用对640×480分辨率的图像进行了评估。此外,将其与相同级联处理体系结构的实现方案进行了比较,但未使用归约方法和单个并行SVM分类器。所提出的体系结构实现了每秒70帧的平均性能,表明在单个并行SVM分类器上的速度提高了5倍。此外,硬件缩减方法可减少43%的硬件资源利用率和20%的功耗,而分类精度仅降低0.7%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号