首页> 外文期刊>IEEE transactions on biomedical circuits and systems >MC-LSTM: Real-Time 3D Human Action Detection System for Intelligent Healthcare Applications
【24h】

MC-LSTM: Real-Time 3D Human Action Detection System for Intelligent Healthcare Applications

机译:MC-LSTM:面向智能医疗应用的实时3D人体动作检测系统

获取原文
获取原文并翻译 | 示例
获取外文期刊封面目录资料

摘要

Due to the movement expressiveness and privacy assurance of human skeleton data, 3D skeleton-based action inference is becoming popular in healthcare applications. These scenarios call for more advanced performance in application-specific algorithms and efficient hardware support. Warnings on health emergencies sensitive to response speed require low latency output and action early detection capabilities. Medical monitoring that works in an always-on edge platform needs the system processor to have extreme energy efficiency. Therefore, in this paper, we propose the MC-LSTM, a functional and versatile 3D skeleton-based action detection system, for the above demands. Our system achieves state-of-the-art accuracy on trimmed and untrimmed cases of general-purpose and medical-specific datasets with early-detection features. Further, the MC-LSTM accelerator supports parallel inference on up to 64 input channels. The implementation on Xilinx ZCU104 reaches a throughput of 18?658 Frames-Per-Second (FPS) and an inference latency of 3.5?ms with the batch size of 64. Accordingly, the power consumption is 3.6?W for the whole FPGA+ARM system, which is 37.8x and 10.4x more energy-efficient than the high-end Titan X GPU and i7-9700 CPU, respectively. Meanwhile, our accelerator also keeps a 4amp;inline-formulaamp;amp;tex-math notation="LaTeX"amp;$sim$amp;/tex-mathamp;amp;/inline-formulaamp;5x energy efficiency advantage against the low-power high-performance Firefly-RK3399 board carrying an ARM Cortex-A72+A53 CPU. We further synthesize an 8-bit quantized version on the same hardware, providing a 48.8 increase in energy efficiency under the same throughput.
机译:由于人体骨骼数据的运动表现力和隐私保证,基于3D骨架的动作推理在医疗保健应用中越来越受欢迎。这些方案要求在特定于应用程序的算法中提供更高级的性能和高效的硬件支持。对响应速度敏感的突发卫生事件预警需要低延迟输出和行动早期检测能力。在始终在线的边缘平台中工作的医疗监测需要系统处理器具有极高的能效。因此,在本文中,我们提出了MC-LSTM,一种功能强大且多功能的基于3D骨架的动作检测系统,以满足上述需求。我们的系统在具有早期检测特征的通用和医学特定数据集的修剪和未修剪病例上实现了最先进的准确性。此外,MC-LSTM 加速器支持在多达 64 个输入通道上进行并行推理。Xilinx ZCU104 上的实现可实现 18?658 帧/秒 (FPS) 的吞吐量和 3.5?ms 的推理延迟,批处理大小为 64。因此,功耗为3.6?整个 FPGA+ARM 系统的 W,能效分别是高端 Titan X GPU 和 i7-9700 CPU 的 37.8 倍和 10.4 倍。同时,我们的加速器还保持了 4inline-formulatex-math notation=“LaTeX”$sim$/tex-math/inline-formula5 倍的能效优势,而搭载 ARM Cortex-A72+A53 CPU 的低功耗高性能 Firefly-RK3399 板。我们在相同的硬件上进一步合成了 8 位量化版本,在相同的吞吐量下,能效提高了 48.8%。

著录项

获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号