Video benchmarks of human action datasets: a review

Singh Tej; Vishwakarma Dinesh Kumar

首页> 外文期刊>Artificial Intelligence Review: An International Science and Engineering Journal >Video benchmarks of human action datasets: a review

【24h】

Video benchmarks of human action datasets: a review

机译：人类行动数据集的视频基准：审查

获取原文

获取原文并翻译 | 示例

获取外文期刊封面目录资料

开具论文收录证明 >>

文献代查 >>

文献数据库（团队版） >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

Vision-based Human activity recognition is becoming a trendy area of research due to its wide application such as security and surveillance, human-computer interactions, patients monitoring system, and robotics. In the past two decades, there are several publically available human action, and activity datasets are reported based on modalities, view, actors, actions, and applications. The objective of this survey paper is to outline the different types of video datasets and highlights their merits and demerits under practical considerations. Based on the available information inside the dataset we can categorise these datasets into RGB (Red, Green, and Blue) and RGB-D(depth). The most prominent challenges involved in these datasets are occlusions, illumination variation, view variation, annotation, and fusion of modalities. The key specification of these datasets is discussed such as resolutions, frame rate, actions/actors, background, and application domain. We have also presented the state-of-the-art algorithms in a tabular form that give the best performance on such datasets. In comparison with earlier surveys, our works give a better presentation of datasets on the well-organised comparison, challenges, and latest evaluation technique on existing datasets.

机译：基于视觉的人类活动识别是由于其广泛的应用，如安全性和监测，人机相互作用，患者监测系统和机器人，成为一种时尚的研究领域。在过去的二十年中，有几个公开可用的人类行动，以及基于模态，视图，演员，行动和应用程序的活动数据集。本调查纸的目的是概述不同类型的视频数据集，并在实际考虑下突出显示它们的优点和缺点。基于数据集中的可用信息，我们可以将这些数据集分类为RGB（红色，绿色和蓝色）和RGB-D（深度）。这些数据集中涉及最突出的挑战是闭塞，照明变化，查看变化，注释和方式的融合。讨论这些数据集的关键规范，例如分辨率，帧速率，动作/演员，背景和应用程序域。我们还以表格形式介绍了最先进的算法，其在此类数据集中提供最佳性能。与早期的调查相比，我们的作品更好地介绍了在有组织的比较，挑战和现有数据集上的最新评估技术上的数据集。

著录项

来源
《Artificial Intelligence Review: An International Science and Engineering Journal》 |2019年第2期|共48页
作者
Singh Tej; Vishwakarma Dinesh Kumar;
展开▼
作者单位

Delhi Technol Univ Dept Elect &

Commun Engn New Delhi India;

Delhi Technol Univ Dept Informat Technol New Delhi India;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类人工智能理论;
关键词
Human action and activity recognition; Survey; RGB dataset; RGB-depth (RGB-D) dataset;

机译：人类行动和活动识别;调查;RGB数据集;RGB-Deave（RGB-D）数据集;

相似文献

外文文献
中文文献
专利

1. Video benchmarks of human action datasets: a review [J] . Singh Tej, Vishwakarma Dinesh Kumar Artificial Intelligence Review: An International Science and Engineering Journal . 2019,第2期

机译：人类行动数据集的视频基准：审查
2. A survey of video datasets for human action and activity recognition [J] . Jose M. Chaquet, Enrique J. Carmona, Antonio Fernandez-Caballero Computer vision and image understanding . 2013,第6期

机译：用于人类动作和活动识别的视频数据集调查
3. A Benchmark Dataset and Comparison Study for Multi-modal Human Action Analytics [J] . JIAYING LIU, SIJIE SONG, CHUNHUI LIU, ACM transactions on multimedia computing communications and applications . 2020,第2期

机译：多模态人体行动分析的基准数据集和比较研究
4. UMPM benchmark: A multi-person dataset with synchronized video and motion capture data for evaluation of articulated human motion and interaction [C] . van der Aa N.P., Luo X., Giezeman G.J., Computer Vision Workshops (ICCV Workshops), 2011 IEEE International Conference on . 2011

机译：UMPM基准：具有同步视频和运动捕获数据的多人数据集，用于评估人为运动和交互活动
5. Representing and Modeling Human Actions in Videos. [D] . Wang, Limin. 2015

机译：在视频中表示和建模人类行为。
6. C-MHAD: Continuous Multimodal Human Action Dataset of Simultaneous Video and Inertial Sensing [O] . Haoran Wei, Pranav Chopada, Nasser Kehtarnavaz 2020

机译：C-MHAD：同时进行视频和惯性传感的连续多模式人体动作数据集
7. Okutama-Action: An Aerial View Video Dataset for Concurrent Human Action Detection [O] . Barekatain, Mohammadamin, Martí, Miquel, Shih, Hsueh-Fu, 2017

机译：Okutama-action：用于并发人类行动的鸟瞰视频数据集发现
8. Large-scale Benchmark Dataset for Event Recognition in Surveillance Video [R] . Oh, S., Hoogs, A., Perera, A., 2011

机译：用于监视视频中事件识别的大规模基准数据集

Video benchmarks of human action datasets: a review

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅