Real-time multimodal ADL recognition using convolution neural networks

Madhuranga Danushka; Madushan Rivindu; Siriwardane Chathuranga; Gunasekera Kutila

首页> 外文期刊>The Visual Computer >Real-time multimodal ADL recognition using convolution neural networks

【24h】

Real-time multimodal ADL recognition using convolution neural networks

机译：使用卷积神经网络的实时多模态ADL识别

获取原文

获取原文并翻译 | 示例

获取外文期刊封面封底 >>

开具论文收录证明 >>

文献代查 >>

团队文献服务 >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

Activities of daily living (ADLs) are the activities which humans perform every day of their lives. Walking, sleeping, eating, drinking and sleeping are examples for ADLs. Compared to RGB videos, depth video-based activity recognition is less intrusive and eliminates many privacy concerns, which are crucial for applications such as life-logging and ambient assisted living systems. Existing methods rely on handcrafted features for depth video classification and ignore the importance of audio stream. In this paper, we propose an ADL recognition system that relies on both audio and depth modalities. We propose to adopt popular convolutional neural network (CNN) architectures used for RGB video analysis to classify depth videos. The adaption poses two challenges: (1) depth data are much nosier and (2) our depth dataset is much smaller compared RGB video datasets. To tackle those challenges, we extract silhouettes from depth data prior to model training and alter deep networks to be shallower. As per our knowledge, we used CNN to segment silhouettes from depth images and fused depth data with audio data to recognize ADLs for the first time. We further extended the proposed techniques to build a real-time ADL recognition system.

机译：日常生活（ADL）的活动是人类每天都能生活的活动。走路，睡觉，饮食，饮酒和睡眠是ADL的示例。与RGB视频相比，基于深度的视频的活动识别不太侵扰，消除许多隐私问题，这对于寿命验证和环境辅助生活系统等应用至关重要。现有方法依赖于深度视频分类的手工制作功能，忽略音频流的重要性。在本文中，我们提出了一种依赖于音频和深度模态的ADL识别系统。我们建议采用用于RGB视频分析的流行卷积神经网络（CNN）架构来分类深度视频。该适应构成了两个挑战：（1）深度数据很多Nosier和（2）我们的深度数据集比较较小，比较RGB视频数据集。为了解决这些挑战，我们在模拟培训之前从深度数据中提取剪影，并改变深网络较浅。根据我们的知识，我们使用CNN从深度图像和融合深度数据的段剪影，其中音频数据首次识别ADL。我们进一步扩展了所提出的技术来构建实时ADL识别系统。

著录项

来源
《The Visual Computer》 |2021年第6期|1263-1276|共14页
作者
Madhuranga Danushka; Madushan Rivindu; Siriwardane Chathuranga; Gunasekera Kutila;
展开▼
作者单位

Univ Moratuwa Dept Comp Sci & Engn Katubedda Sri Lanka;

Univ Moratuwa Dept Comp Sci & Engn Katubedda Sri Lanka;

Univ Moratuwa Dept Comp Sci & Engn Katubedda Sri Lanka;

Univ Moratuwa Dept Comp Sci & Engn Katubedda Sri Lanka;

展开▼
收录信息美国《科学引文索引》(SCI);美国《工程索引》(EI);
原文格式 PDF
正文语种 eng
中图分类
关键词
Activity recognition; Depth images; Video classification; Data fusion; Silhouette extraction;

机译：活动识别;深度图像;视频分类;数据融合;剪影提取;

相似文献

外文文献
中文文献
专利

1. Segment spatial-temporal representation and cooperative learning of convolution neural networks for multimodal-based action recognition [J] . Ren Ziliang, Zhang Qieshi, Cheng Jun, Neurocomputing . 2021,第Apra14期

机译：段空间 - 时间代表与基于多式联动作识别的卷积神经网络的合作学习
2. Multimodal facial biometrics recognition: Dual-stream convolutional neural networks with multi-feature fusion layers [J] . Tiong Leslie Ching Ow, Kim Seong Tae, Ro Yong Man Image and Vision Computing . 2020,第Octa期

机译：多模式面部生物识别：双流卷积神经网络，具有多种融合层
3. Deep Convolutional and LSTM Recurrent Neural Networks for Multimodal Wearable Activity Recognition [J] . Francisco Javier Ordó?ez, Daniel Roggen Sensors . 2016,第1期

机译：深度卷积和LSTM递归神经网络用于多模式可穿戴活动识别
4. Method for Multimodal Recognition of One-Handed Sign Language Gestures Through 3D Convolution and LSTM Neural Networks [C] . Ildar Kagirov, Dmitry Ryumin, Alexandr Axyonov International Conference on Speech and Computer . 2019

机译：3D卷积和LSTM神经网络的单手手势手势多模态识别方法
5. A Scalable and Low Power Deep Convolutional Neural Network for Multimodal Data Classification in Embedded Real-Time Systems [D] . Jafari, Ali. 2017

机译：用于嵌入式实时系统中的多模式数据分类的可扩展和低功耗的深卷积神经网络
6. Deep Convolutional and LSTM Recurrent Neural Networks for Multimodal Wearable Activity Recognition [O] . Francisco Javier Ordóñez, Daniel Roggen 2016

机译：深度卷积和LSTM递归神经网络用于多模式可穿戴活动识别
7. RGB-D-Based Object Recognition Using Multimodal Convolutional Neural Networks: A Survey [O] . Mingliang Gao, Jun Jiang, Guofeng Zou, 2019

机译：基于RGB-D的对象识别使用多模式卷积神经网络：调查

Real-time multimodal ADL recognition using convolution neural networks

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅