首页> 外文会议>International Symposium on Field-Programmable Custom Computing Machines >Accelerator of Stacked Convolutional Independent Subspace Analysis for Deep Learning-Based Action Recognition
【24h】

Accelerator of Stacked Convolutional Independent Subspace Analysis for Deep Learning-Based Action Recognition

机译:基于深度学习的行动识别堆叠卷积独立子空间分析的加速器

获取原文

摘要

Action recognition has been a research challenge in multimedia computing and machine vision. Recent advances in deep learning combined with stacked convolutional Independent Subspace Analysis (ISA) has achieved a better performance superior to all previously published results on several public available data sets. Unfortunately, one major issue in large-scale deployment of this new deep learning-based approach is the unacceptable latency of training with high-dimension data. In this paper, we propose a new hardware accelerator that can reduce the training time substantially for deep learning-based action recognition. Specifically, our proposed approach focuses on accelerating the convolutional stacked ISA algorithm, the core components of the deep learning-based action recognition algorithms. We design parallel pipelines, data parallelisms and look-up table to speed up the algorithm. With an embedded heterogeneous platform consisting of a general purpose processor and a FPGA, we are able to achieve up to 10X speedup for stacked ISA training compared to a software-only implementation.
机译:行动识别是多媒体计算和机器视觉中的研究挑战。深度学习的最新进展与堆叠的卷积独立子空间分析(ISA)相结合,已经取得了更好的性能,以优于几种公共可用数据集的所有出版结果。不幸的是,大规模部署这种基于深度学习的方法的大型问题是具有高维数据的培训的不可接受的延迟。在本文中,我们提出了一种新的硬件加速器,可以基于基于深度学习的动作识别来降低培训时间。具体而言,我们所提出的方法侧重于加速卷积堆积的ISA算法,基于深度学习的动作识别算法的核心组件。我们设计并行管道,数据并行和查找表以加快算法。对于由通用处理器和FPGA组成的嵌入式异构平台,与仅软件实现相比,我们能够实现最多10倍的加速,用于堆叠ISA培训。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号