...
首页> 外文期刊>Multimedia Tools and Applications >Elderly fall detection based on multi-stream deep convolutional networks
【24h】

Elderly fall detection based on multi-stream deep convolutional networks

机译:基于多流深卷积网络的老人跌倒检测

获取原文
获取原文并翻译 | 示例
           

摘要

Fall is the biggest threat to seniors, with significant emotional, physical and financial implications. It is the major cause of serious injuries, disabilities, hospitalizations and even death especially for elderly people living alone. Timely detection could provide immediate medical service to the injured and avoid its harmful consequences. Great number of vision-based techniques has been proposed by installing cameras in several everyday environments. Recently, deep learning has revolutionized these techniques, mostly using convolutional neural networks (CNNs). In this paper, we propose weighted multi-stream deep convolutional neural networks that exploit the rich multimodal data provided by RGB-D cameras. Our method detects automatically fall events and sends a help request to the caregivers. Our contribution is three-fold. We build a new architecture composed of four separate CNN streams, one for each modality. The first modality is based on a single combined RGB and depth image to encode static appearance information. RGB image is used to capture color and texture and depth image deals with illumination variations. In contrast of the first feature that lacks the contextual information about previous and next frames, the second modality characterizes the human shape variations. After background subtraction and person recognition, human silhouette is extracted and stacked to define history of binary motion HBMI. The last two modalities are used to more discriminate the motion information. Stacked amplitude and oriented flow are used in addition to stacked optical flow field to describe respectively the velocity, the direction and the motion displacements. The main motivation behind the use of these multimodal data is to combine complementary information such as motion, shape, RGB and depth appearance to achieve more accurate detection than using only one modality. Our second contribution is the combination of the four streams to generate the final decision for fall detection. We evaluate early and late fusion strategies and we have defined the weight of each modality based on its overall system performance. Weighted score fusion is finally adopted based on our experiments. In the third contribution, transfer learning and data augmentation are applied to increase the amount of training data, avoid over fitting and improve the accuracy. Experiments have been conducted on publicly available standard datasets and demonstrate the effectiveness of the proposed method compared to existing methods.
机译:堕落是对老年人的最大威胁,具有重要的情感,身体和财务影响。这是严重伤害,残疾,住院,甚至死亡的主要原因,特别是对于独居的老年人来说。及时检测可以向受伤的伤害提供立即医疗服务,避免其有害后果。通过在几个日常环境中安装相机,提出了大量的基于视觉技术。最近,深度学习彻底改变了这些技术,主要是使用卷积神经网络(CNNS)。在本文中,我们提出了加权多流深卷积神经网络,该神经网络利用RGB-D相机提供的丰富的多模数据。我们的方法自动检测到事件并向看护人发送帮助请求。我们的贡献是三倍。我们构建一个由四个单独的CNN流组成的新架构,一个用于每个模态。第一模态基于单个组合的RGB和深度图像来编码静态外观信息。 RGB图像用于捕获具有照明变化的颜色和纹理和深度图像。相反,缺少关于上一帧和下帧的上下文信息的第一特征,第二模态表征了人类形状变化。在背景减法和人识别之后,提取和堆叠人的轮廓以定义二进制运动HBMI的历史。最后两个模态用于更加辨别运动信息。除了堆叠的光学流场之外,还使用堆叠的幅度和定向流,以分别用于描述速度,方向和运动位移。使用这些多模式数据背后的主要动机是将诸如运动,形状,RGB和深度外观的互补信息组合,以实现比仅使用一个模态的更准确的检测。我们的第二次贡献是四条流的结合,以产生跌倒检测的最终决定。我们评估早期和晚期融合策略,我们根据其整体系统性能定义了每种方式的重量。基于我们的实验,最终采用了加权分数融合。在第三种贡献中,应用转移学习和数据增强以增加培训数据量,避免拟合并提高准确性。已经在公开的标准数据集上进行了实验,并证明了与现有方法相比该方法的有效性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号