首页> 外文会议> >Seeing What You#039;re Told: Sentence-Guided Activity Recognition in Video

【24h】

Seeing What You#039;re Told: Sentence-Guided Activity Recognition in Video

机译：明白自己的意思：视频中的句子引导活动识别

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

We present a system that demonstrates how the compositional structure of events, in concert with the compositional structure of language, can interplay with the underlying focusing mechanisms in video action recognition, providing a medium for top-down and bottom-up integration as well as multi-modal integration between vision and language. We show how the roles played by participants (nouns), their characteristics (adjectives), the actions performed (verbs), the manner of such actions (adverbs), and changing spatial relations between participants (prepositions), in the form of whole-sentence descriptions mediated by a grammar, guides the activity-recognition process. Further, the utility and expressiveness of our framework is demonstrated by performing three separate tasks in the domain of multi-activity video: sentence-guided focus of attention, generation of sentential description, and query-based search, simply by leveraging the framework in different manners.

机译：我们提供了一个系统，该系统演示了事件的构成结构与语言的构成结构如何可以与视频动作识别中的基本聚焦机制相互作用，从而为自上而下和自下而上的集成以及多种-视觉和语言之间的模式整合。我们以整体形式展示参与者（名词）扮演的角色，他们的特征（形容词），所执行的动作（动词），此类动作的方式（副词）以及参与者之间的空间关系（介词）如何变化。语法介导的句子描述指导活动识别过程。此外，通过在多活动视频领域中执行三个单独的任务来证明我们框架的实用性和表达力：简单地通过在不同活动中利用框架，便可以集中注意力进行句子引导的关注点，句子描述的生成和基于查询的搜索举止。

著录项

来源
《》|2014年|732-739|共8页
会议地点
作者
Siddharth N.; Barbu Andrei; Siskind Jeffrey Mark;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. CAN YOU SEE IT?GOOD,SO WE CAN SENSE IT!Pushing the Boundaries of IMU-Based Human Activity Recognition Using Videos [J] . Hyeokhyen Kwon, Catherine Tong, Harish Haresamudram, Mobile Computing and Communications Review . 2021,第2期

机译：你能看到它吗？好，所以我们可以感觉到它！使用视频推动基于IMU的人类活动识别的界限
2. Automated excavators activity recognition and productivity analysis from construction site surveillance videos [J] . Automation in construction . 2020,第Feba期

机译：通过施工现场监控视频自动进行挖掘机活动识别和生产率分析
3. Data-level information enhancement: Motion-patch-based Siamese Convolutional Neural Networks for human activity recognition in videos [J] . Zhang Yujia, Po Lai Man, Liu Mengyang, Expert systems with applications . 2020,第Juna期

机译：数据级信息增强：视频中的运动补丁暹罗卷积神经网络，用于视频中的人类活动识别
4. Seeing What You#039;re Told: Sentence-Guided Activity Recognition in Video [C] . Siddharth N., Barbu Andrei, Siskind Jeffrey Mark IEEE Conference on Computer Vision and Pattern Recognition . 2014

机译：看到你所说的内容：视频中的句子导向活动识别
5. Human Activity Recognition from Egocentric Videos and Robustness Analysis of Deep Neural Networks [D] . Lu, Yantao. 2020

机译：从深神经网络的Egentric视频和鲁棒性分析的人类活动识别
6. Activity Recognition for Ambient Assisted Living with Videos Inertial Units and Ambient Sensors [O] . Caetano Mazzoni Ranieri, Scott MacLeod, Mauro Dragone, 2021

机译：活动识别与视频惯性单元和环境传感器辅助的环境辅助
7. Seeing What You’re Told: Sentence-Guided Activity Recognition In Video [O] . Siddharth Narayanaswamy, Barbu Andrei, Siskind Jeffrey Mark 2014

机译：看你在说什么：视频中的句子引导活动识别

Seeing What You#039;re Told: Sentence-Guided Activity Recognition in Video

摘要

著录项

相似文献

相关主题

期刊订阅