首页> 美国卫生研究院文献>Sensors (Basel Switzerland) >Spatio-Temporal Action Detection in Untrimmed Videos by Using Multimodal Features and Region Proposals

【2h】

Spatio-Temporal Action Detection in Untrimmed Videos by Using Multimodal Features and Region Proposals

机译：利用多峰特征和区域提议检测未修剪视频中的时空行为

代理获取

本网站仅为用户提供外文OA文献查询和代理获取服务，本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文，但由于OA文献来源多样且变更频繁，仍可能出现获取不到、文献不完整或与标题不符等情况，如果获取不到我们将提供退款服务。请知悉。

页面导航

摘要
著录项
相似文献
相关主题

摘要

This paper proposes a novel deep neural network model for solving the spatio-temporal-action-detection problem, by localizing all multiple-action regions and classifying the corresponding actions in an untrimmed video. The proposed model uses a spatio-temporal region proposal method to effectively detect multiple-action regions. First, in the temporal region proposal, anchor boxes were generated by targeting regions expected to potentially contain actions. Unlike the conventional temporal region proposal methods, the proposed method uses a complementary two-stage method to effectively detect the temporal regions of the respective actions occurring asynchronously. In addition, to detect a principal agent performing an action among the people appearing in a video, the spatial region proposal process was used. Further, coarse-level features contain comprehensive information of the whole video and have been frequently used in conventional action-detection studies. However, they cannot provide detailed information of each person performing an action in a video. In order to overcome the limitation of coarse-level features, the proposed model additionally learns fine-level features from the proposed action tubes in the video. Various experiments conducted using the LIRIS-HARL and UCF-10 datasets confirm the high performance and effectiveness of the proposed deep neural network model.

机译：本文提出了一种新颖的深度神经网络模型，通过定位所有多个动作区域并对未修剪视频中的相应动作进行分类来解决时空动作检测问题。所提出的模型使用时空区域提议方法来有效地检测多动作区域。首先，在临时区域提议中，锚定框是通过定位预期可能包含动作的区域生成的。与常规的时间区域提议方法不同，该提议的方法使用互补的两阶段方法来有效地检测异步发生的各个动作的时间区域。另外，为了检测在视频中出现的人中执行动作的委托人，使用了空间区域提议处理。此外，粗略特征包含整个视频的全面信息，并已在常规的动作检测研究中频繁使用。但是，他们无法提供每个人在视频中执行操作的详细信息。为了克服粗糙特征的限制，提出的模型还从视频中提出的动作管中学习了精细特征。使用LIRIS-HARL和UCF-10数据集进行的各种实验证实了所提出的深度神经网络模型的高性能和有效性。

著录项

期刊名称 Sensors (Basel Switzerland)
作者
Yeongtaek Song; Incheol Kim;
展开▼
作者单位

展开▼
年(卷),期 2019(19),5
年度 2019
页码 1085
总页数 19
原文格式 PDF
正文语种
中图分类
关键词
video action detection region proposal spatio-temporal action detection recurrent neural network;

机译：视频动作检测;区域提议;时空动作检测;递归神经网络;

相似文献

外文文献
中文文献
专利

1. Segment-Tube: Spatio-Temporal Action Localization in Untrimmed Videos with Per-Frame Segmentation [J] . Le Wang, Xuhuan Duan, Qilin Zhang, Sensors . 2018,第5期

机译：Segment-Tube：具有按帧分割的未修剪视频中的时空行为本地化
2. Rethinking Online Action Detection in Untrimmed Videos: A Novel Online Evaluation Protocol [J] . Baptista-Rios Marcos, Lopez-Sastre Roberto J., Caba Heilbron Fabian, Quality Control, Transactions . 2020,第期

机译：重新思考在未限制视频中的在线行动检测：一种小说在线评估协议
3. A two-stage temporal proposal network for precise action localization in untrimmed video [J] . Wang Fei, Wang Guorui, Du Yuxuan, International journal of machine learning and cybernetics . 2021,第8期

机译：一个两阶段时间建议网络，用于未经监测视频中的精确行动定位
4. A Proposal-Based Solution to Spatio-Temporal Action Detection in Untrimmed Videos [C] . Joshua Gleason, Rajeev Ranjan, Steven Schwarcz, IEEE Winter Conference on Applications of Computer Vision . 2019

机译：基于建议的未修剪视频时空动作检测解决方案
5. Generating Temporal Action Proposals in Long Untrimmed Videos [D] . Vaishnavi, Pratik 2018

机译：在未修剪的长视频中生成时间动作建议
6. Segment-Tube: Spatio-Temporal Action Localization in Untrimmed Videos with Per-Frame Segmentation [O] . Le Wang, Xuhuan Duan, Qilin Zhang, 2018

机译：Segment-Tube：具有按帧分割的未修剪视频中的时空行为本地化
7. A Proposal-Based Solution to Spatio-Temporal Action Detection in Untrimmed Videos [O] . Joshua Gleason, Rajeev Ranjan, Steven Schwarcz, 2019

机译：基于提案的外部视频动作检测解决方案
8. Keypoint Density-Based Region Proposal for Fine-Grained Object Detection and Classification Using Regions with Convolutional Neural Network Features. [R] . Turner, J. T., Gupta, K., Morris, B., 2015

机译：基于关键点密度的区域提议，用于使用具有卷积神经网络特征的区域进行细粒度目标检测和分类。

Spatio-Temporal Action Detection in Untrimmed Videos by Using Multimodal Features and Region Proposals

摘要

著录项

相似文献

相关主题

期刊订阅