首页> 外文会议>International Symposium on Field-Programmable Custom Computing Machines >Accelerator of Stacked Convolutional Independent Subspace Analysis for Deep Learning-Based Action Recognition
【24h】

Accelerator of Stacked Convolutional Independent Subspace Analysis for Deep Learning-Based Action Recognition

机译:基于深度学习的动作识别的堆叠卷积独立子空间分析加速器

获取原文

摘要

Action recognition has been a research challenge in multimedia computing and machine vision. Recent advances in deep learning combined with stacked convolutional Independent Subspace Analysis (ISA) has achieved a better performance superior to all previously published results on several public available data sets. Unfortunately, one major issue in large-scale deployment of this new deep learning-based approach is the unacceptable latency of training with high-dimension data. In this paper, we propose a new hardware accelerator that can reduce the training time substantially for deep learning-based action recognition. Specifically, our proposed approach focuses on accelerating the convolutional stacked ISA algorithm, the core components of the deep learning-based action recognition algorithms. We design parallel pipelines, data parallelisms and look-up table to speed up the algorithm. With an embedded heterogeneous platform consisting of a general purpose processor and a FPGA, we are able to achieve up to 10X speedup for stacked ISA training compared to a software-only implementation.
机译:动作识别已成为多媒体计算和机器视觉中的研究挑战。结合堆叠卷积独立子空间分析(ISA)的深度学习方面的最新进展取得了比以前在多个公共可用数据集上发布的所有结果更好的性能。不幸的是,这种基于深度学习的新方法的大规模部署中的一个主要问题是使用高维数据进行训练的延迟不可接受。在本文中,我们提出了一种新的硬件加速器,可以大大减少基于深度学习的动作识别的训练时间。具体来说,我们提出的方法着重于加速卷积堆叠ISA算法,这是基于深度学习的动作识别算法的核心组成部分。我们设计并行管道,数据并行性和查找表以加快算法的速度。借助由通用处理器和FPGA组成的嵌入式异构平台,与纯软件实现相比,我们能够将堆叠式ISA培训的速度提高多达10倍。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号