Detecting the Starting Frame of Actions in Video

机译：检测视频中动作的开始帧

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

In this work, we address the problem of precisely localizing key frames of an action, for example, the precise time that a pitcher releases a baseball, or the precise time that a crowd begins to applaud. Key frame localization is a largely overlooked and important action-recognition problem, for example in the field of neuroscience, in which we would like to understand the neural activity that produces the start of a bout of an action. To address this problem, we introduce a novel structured loss function that properly weights the types of errors that matter in such applications: it more heavily penalizes extra and missed action start detections over small misalignments. Our structured loss is based on the best matching between predicted and labeled action starts. We train recurrent neural networks (RNNs) to minimize differentiable approximations of this loss. To evaluate these methods, we introduce the Mouse Reach Dataset, a large, annotated video dataset of mice performing a sequence of actions. The dataset was collected and labeled by experts for the purpose of neuroscience research. On this dataset, we demonstrate that our method outperforms related approaches and baseline methods using an unstructured loss.

机译：在这项工作中，我们解决了精确定位动作关键帧的问题，例如，投手释放棒球的准确时间或人群开始鼓掌的精确时间。关键帧定位是一个在很大程度上被忽略且重要的动作识别问题，例如在神经科学领域，我们想要了解导致动作开始的神经活动。为了解决这个问题，我们引入了一种新颖的结构化损失函数，该函数适当地权衡了此类应用中重要的错误类型：它对较小的失准会更严厉地惩罚额外的和错过的动作开始检测。我们的结构性损失基于预测的动作和标记的动作开始之间的最佳匹配。我们训练递归神经网络（RNN），以最小化此损失的可区分近似值。为了评估这些方法，我们引入了Mouse Reach Dataset（鼠标到达数据集），这是一个大型的，带注释的执行操作序列的鼠标视频集。为了神经科学研究的目的，收集了该数据集并由专家进行了标记。在此数据集上，我们证明了我们的方法使用非结构化损失优于相关方法和基线方法。

著录项

来源
《IEEE Winter Conference on Applications of Computer Vision》|2020年|478-486|共9页
会议地点
作者
Iljung S. Kwak; Jian-Zhong Guo; Adam Hantman; Kristin Branson; David Kriegman;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Mice; Neuroscience; Optimal matching; Neural activity; Recurrent neural networks; Task analysis; Neurons;

机译：小鼠;神经科学;最佳匹配;神经活动;递归神经网络;任务分析;神经元;

相似文献

外文文献
中文文献
专利

1. Authentication of Surveillance Videos: Detecting Frame Duplication Based on Residual Frame [J] . Fadl Sondos M., Han Qi, Li Qiong Journal of forensic sciences. . 2018,第4期

机译：监控视频的认证：检测基于残差帧的帧复制
2. Using Noise Level to Detect Frame Repetition Forgery in Video Frame Rate Up-Conversion [J] . Yanli Li, Lala Mei, Ran Li, Future Internet . 2018,第9期

机译：在视频帧速率上转换中使用噪声级检测帧重复伪造
3. Detecting video frame rate up-conversion based on frame-level analysis of average texture variation [J] . Xia Min, Yang Gaobo, Li Leida, Multimedia Tools and Applications . 2017,第6期

机译：基于平均纹理变化的帧级分析检测视频帧率上转换
4. Combining background subtraction and three-frame difference to detect moving object from underwater video [C] . Hongkun Liu, Jialun Dai, Ruchen Wang, OCEANS . 2016

机译：结合背景减法和三帧差异从水下视频中检测运动物体
5. A New Approach to Detecting Frame Deletion in H.264 Encoded Digital Video [D] . Kippen, Hunter. 2019

机译：H.264编码数字视频中检测帧删除的新方法
6. Segment-Tube: Spatio-Temporal Action Localization in Untrimmed Videos with Per-Frame Segmentation [O] . Le Wang, Xuhuan Duan, Qilin Zhang, 2018

机译：Segment-Tube：具有按帧分割的未修剪视频中的时空行为本地化
7. Occupational therapy intervention with a child is based upon an understanding and appreciation of normal development. Knowledge of current concepts and theories related to child development is essential when occupational therapist evaluates children. This background information helps therapist to plan intervention for the child. The aim of this study is to make observation video about development of about one year old child. The purpose of my study is to help occupational therapy students learn about child development. My study is practice-based thesis. It includes product, which is the observation video and study rapport. I describe my whole process in my rapport. The process includes different kinds of stages. First, I studied those theories of child development, which are used in the studies of occupational therapy for children. These theories are Moseys Developmental Frame of Reference and the theory of development according to Sensory Integration Theory. These theories are the frames of reference of my study. I organize the child development areas according to child occupations and skills. Then I start to plan, film and edit my video based on the theories of child development and the principles of making a video. In my rapport I describe all the stages of my study and explain the sequence and the content of the stages. I also evaluate the process of my study. In the observation video you can see those stages of development where about one year old child is based on the frames of reference, which I have used in my study. I believe that my observation video can at least be good for inspiring occupational therapy students learning about child development. Keywords child development, learning, observation video [O] . Lehtinen Ann-Mari 2006

机译：对儿童的职业治疗干预基于对正常发育的理解和欣赏。当职业治疗师评估儿童时，与儿童发育相关的当前概念和理论的知识必不可少。这些背景信息可帮助治疗师为孩子计划干预措施。这项研究的目的是制作有关约一岁儿童发育的观察视频。我研究的目的是帮助职业治疗学生学习儿童成长。我的研究是基于实践的论文。它包括产品，这是观察视频和学习融洽的关系。我以融洽的方式描述我的整个过程。该过程包括不同阶段。首先，我研究了有关儿童发育的理论，这些理论被用于儿童的职业治疗研究中。这些理论是Moseys发展参考框架和根据感觉统合理论的发展理论。这些理论是我研究的参考框架。我根据儿童职业和技能组织儿童发展领域。然后，我根据儿童发育理论和视频制作原理开始计划，拍摄和编辑视频。在融洽的关系中，我描述了学习的所有阶段，并解释了这些阶段的顺序和内容。我还评估了我的学习过程。在观察视频中，您可以看到那些发展阶段，其中大约一岁的孩子基于我的研究框架。我相信，我的观察视频至少可以对激发职业治疗的学生学习儿童发育有帮助。关键字儿童发展，学习，观察视频

Detecting the Starting Frame of Actions in Video

摘要

著录项

相似文献

相关主题

期刊订阅