...
首页> 外文期刊>The Visual Computer >Real-time multimodal ADL recognition using convolution neural networks
【24h】

Real-time multimodal ADL recognition using convolution neural networks

机译:使用卷积神经网络的实时多模态ADL识别

获取原文
获取原文并翻译 | 示例

摘要

Activities of daily living (ADLs) are the activities which humans perform every day of their lives. Walking, sleeping, eating, drinking and sleeping are examples for ADLs. Compared to RGB videos, depth video-based activity recognition is less intrusive and eliminates many privacy concerns, which are crucial for applications such as life-logging and ambient assisted living systems. Existing methods rely on handcrafted features for depth video classification and ignore the importance of audio stream. In this paper, we propose an ADL recognition system that relies on both audio and depth modalities. We propose to adopt popular convolutional neural network (CNN) architectures used for RGB video analysis to classify depth videos. The adaption poses two challenges: (1) depth data are much nosier and (2) our depth dataset is much smaller compared RGB video datasets. To tackle those challenges, we extract silhouettes from depth data prior to model training and alter deep networks to be shallower. As per our knowledge, we used CNN to segment silhouettes from depth images and fused depth data with audio data to recognize ADLs for the first time. We further extended the proposed techniques to build a real-time ADL recognition system.
机译:日常生活(ADL)的活动是人类每天都能生活的活动。走路,睡觉,饮食,饮酒和睡眠是ADL的示例。与RGB视频相比,基于深度的视频的活动识别不太侵扰,消除许多隐私问题,这对于寿命验证和环境辅助生活系统等应用至关重要。现有方法依赖于深度视频分类的手工制作功能,忽略音频流的重要性。在本文中,我们提出了一种依赖于音频和深度模态的ADL识别系统。我们建议采用用于RGB视频分析的流行卷积神经网络(CNN)架构来分类深度视频。该适应构成了两个挑战:(1)深度数据很多Nosier和(2)我们的深度数据集比较较小,比较RGB视频数据集。为了解决这些挑战,我们在模拟培训之前从深度数据中提取剪影,并改变深网络较浅。根据我们的知识,我们使用CNN从深度图像和融合深度数据的段剪影,其中音频数据首次识别ADL。我们进一步扩展了所提出的技术来构建实时ADL识别系统。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号