首页> 外文会议>IEEE/RSJ International Conference on Intelligent Robots and Systems >ETRI-Activity3D: A Large-Scale RGB-D Dataset for Robots to Recognize Daily Activities of the Elderly
【24h】

ETRI-Activity3D: A Large-Scale RGB-D Dataset for Robots to Recognize Daily Activities of the Elderly

机译:Etri-Activity3D:用于机器人的大型RGB-D数据集,以识别老年人的日常活动

获取原文

摘要

Deep learning, based on which many modern algorithms operate, is well known to be data-hungry. In particular, the datasets appropriate for the intended application are difficult to obtain. To cope with this situation, we introduce a new dataset called ETRI-Activity3D, focusing on the daily activities of the elderly in robot-view. The major characteristics of the new dataset are as follows: 1) practical action categories that are selected from the close observation of the daily lives of the elderly; 2) realistic data collection, which reflects the robot’s working environment and service situations; and 3) a large-scale dataset that overcomes the limitations of the current 3D activity analysis benchmark datasets. The proposed dataset contains 112,620 samples including RGB videos, depth maps, and skeleton sequences. During the data acquisition, 100 subjects were asked to perform 55 daily activities. Additionally, we propose a novel network called four-stream adaptive CNN (FSA-CNN). The proposed FSA-CNN has three main properties: robustness to spatio-temporal variations, input-adaptive activation function, and extension of the conventional two-stream approach. In the experiment section, we confirmed the superiority of the proposed FSA-CNN using NTU RGB+D and ETRI-Activity3D. Further, the domain difference between both groups of age was verified experimentally. Finally, the extension of FSA-CNN to deal with the multimodal data was investigated.
机译:深度学习的基础上,很多现代的算法操作,是众所周知的是大量数据的。特别地,数据集适合于预期应用是很难获得的。为了应对这种情况,我们引入了一个名为ETRI-Activity3D新的数据集,着眼于老年人的机器人视图中的日常活动。是新的数据集的主要特点如下:即从老年人的日常生活密切观察选择1)用实际行动类别; 2)现实的数据收集,这反映了机器人的工作环境和服务的情况;和3)的大规模数据集,其克服了当前的3D活动分析基准数据集的限制。所提出的数据集包含112620个样品,包括RGB视频,深度图,和骨架序列。在数据采集期间,100名受试者被要求执行55个日常活动。此外,我们提出了一种所谓的四流自适应CNN(FSA-CNN)的新网络。所提出的FSA-CNN有三个主要性能:鲁棒性时空变型中,输入自适应激活功能,和常规的两流方法的扩展。在实验部分,我们证实了提出FSA-CNN的使用NTU RGB + d和ETRI-Activity3D的优越性。此外,年龄这两个群体之间的差异域实验验证。最后,FSA-CNN的扩展来处理多模数据进行了研究。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号