...
首页> 外文期刊>ACM Transactions on Embedded Computing Systems >Low Overhead CS-Based Heterogeneous Framework for Big Data Acceleration
【24h】

Low Overhead CS-Based Heterogeneous Framework for Big Data Acceleration

机译:基于头顶CS的基于CS的异构框架,用于大数据加速度

获取原文
获取原文并翻译 | 示例
           

摘要

Big data processing on hardware gained immense interest among the hardware research community to take advantage of fast processing and reconfigurability. Though the computation latency can be reduced using hardware, big data processing cost is dominated by data transfers. In this article, we propose a low overhead framework based on compressive sensing (CS) to reduce data transfers up to 67% without affecting signal quality. CS has two important kernels: "sensing" and "reconstruction." In this article, we focus on CS reconstruction is using orthogonal matching pursuit (OMP) algorithm. We implement the OMP CS reconstruction algorithm on a domain-specific PENC many-core platform and a low-power Jetson TK1 platform consisting of an ARM CPU and a K1 GPU. Detailed performance analysis of OMP algorithm on each platform suggests that the PENC many-core platform has 15x and 18x less energy consumption and 16x and 8x faster reconstruction time as compared to the low-power ARM CPU and K1 GPU, respectively. Furthermore, we implement the proposed CS-based framework on heterogeneous architecture, in which the PENC many-core architecture is used as an "accelerator" and processing is performed on the ARM CPU platform. For demonstration, we integrate the proposed CS-based framework with a hadoop MapReduce platform for a face detection application. The results show that the proposed CS-based framework with the PENC many-core as an accelerator achieves a 26.15% data storage/transfer reduction, with an execution time and energy consumption overhead of 3.7% and 0.002%, respectively, for 5,000 image transfers. Compared to the CS-based framework implementation on the low-power Jetson TK1 ARM CPU+GPU platform, the PENC many-core implementation is 2.3x faster for the image reconstruction part, while achieving 29% higher performance and 34% better energy efficiency for the complete face detection application on the Hadoop MapReduce platform.
机译:硬件的大数据处理在硬件研究界中获得了巨大的兴趣,以利用快速加工和可重新配置性。虽然使用硬件可以减少计算延迟,但大数据处理成本由数据传输主导。在本文中,我们提出了一种基于压缩感测(CS)的低开销框架,以减少高达67%的数据转移而不会影响信号质量。 CS有两个重要的内核:“感应”和“重建”。在本文中,我们专注于CS重建正在使用正交匹配追求(OMP)算法。我们在特定于域的Penc许多核心平台和由ARM CPU和K1 GPU组成的低功耗Jetson TK1平台上实现了OMP CS重建算法。每个平台上的OMP算法的详细性能分析表明,与低功率ARM CPU和K1 GPU分别相比,Penc许多核平台具有15倍和18倍的能耗和18倍。此外,我们在异构架构上实现了基于CS的基于CS的框架,其中Penc许多核心架构用作“加速器”,并且在ARM CPU平台上执行处理。为了演示,我们将所提出的CS系列与Hadoop MakReduce平台集成为面部检测应用程序。结果表明,拟议的基于CS的框架与Penc的核心作为加速器的数量核心达到26.15%的数据存储/转移减少,分别具有3.7%和0.002%的执行时间和能量消耗,为5,000个图像转移。与低功耗Jetson TK1 ARM CPU + GPU平台上的基于CS的框架实现相比,Penc多核的实现对于图像重建部分具有2.3倍,同时性能更高的29%和34%的能源效率Hadoop MapReduce平台上的完整面部检测应用程序。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号