【24h】

Video benchmarks of human action datasets: a review

机译:人类行动数据集的视频基准:审查

获取原文
获取原文并翻译 | 示例
获取外文期刊封面目录资料

摘要

Vision-based Human activity recognition is becoming a trendy area of research due to its wide application such as security and surveillance, human-computer interactions, patients monitoring system, and robotics. In the past two decades, there are several publically available human action, and activity datasets are reported based on modalities, view, actors, actions, and applications. The objective of this survey paper is to outline the different types of video datasets and highlights their merits and demerits under practical considerations. Based on the available information inside the dataset we can categorise these datasets into RGB (Red, Green, and Blue) and RGB-D(depth). The most prominent challenges involved in these datasets are occlusions, illumination variation, view variation, annotation, and fusion of modalities. The key specification of these datasets is discussed such as resolutions, frame rate, actions/actors, background, and application domain. We have also presented the state-of-the-art algorithms in a tabular form that give the best performance on such datasets. In comparison with earlier surveys, our works give a better presentation of datasets on the well-organised comparison, challenges, and latest evaluation technique on existing datasets.
机译:基于视觉的人类活动识别是由于其广泛的应用,如安全性和监测,人机相互作用,患者监测系统和机器人,成为一种时尚的研究领域。在过去的二十年中,有几个公开可用的人类行动,以及基于模态,视图,演员,行动和应用程序的活动数据集。本调查纸的目的是概述不同类型的视频数据集,并在实际考虑下突出显示它们的优点和缺点。基于数据集中的可用信息,我们可以将这些数据集分类为RGB(红色,绿色和蓝色)和RGB-D(深度)。这些数据集中涉及最突出的挑战是闭塞,照明变化,查看变化,注释和方式的融合。讨论这些数据集的关键规范,例如分辨率,帧速率,动作/演员,背景和应用程序域。我们还以表格形式介绍了最先进的算法,其在此类数据集中提供最佳性能。与早期的调查相比,我们的作品更好地介绍了在有组织的比较,挑战和现有数据集上的最新评估技术上的数据集。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号