A Deep Learning based Approach for Precise Video Tagging

机译：基于深度学习的精确视频标记方法

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

With the increase in smart devices and abundance of video contents, efficient techniques for the indexing, analysis and retrieval of videos are becoming more and more desirable. Improved indexing and automated analysis of millions of videos could be accomplished by getting videos tagged automatically. A lot of existing methods fail to precisely tag videos because of their lack of ability to capture the video context. The context in a video represents the interactions of objects in a scene and their overall meaning. In this work, we propose a novel approach that integrates the video scene ontology with CNN (Convolutional Neural Network) for improved video tagging. Our method captures the content of a video by extracting the information from individual key frames. The key frames are then fed to a CNN based deep learning model to train its parameters. The trained parameters are used to generate the most frequent tags. Highly frequent tags are used to summarize the input video. The proposed technique is benchmarked on the most widely used dataset of video activities, namely, UCF-101. Our method managed to achieve an overall accuracy of 99.8% with an F1- score of 96.2%.

机译：随着智能设备的增加和视频内容的丰富，用于视频的索引，分析和检索的有效技术变得越来越受欢迎。通过自动标记视频，可以改进对数百万个视频的索引编制和自动分析。由于缺乏捕获视频上下文的能力，许多现有方法无法精确标记视频。视频中的上下文表示场景中对象的交互及其整体含义。在这项工作中，我们提出了一种新颖的方法，该方法将视频场景本体与CNN（卷积神经网络）集成在一起，以改进视频标记。我们的方法通过从各个关键帧中提取信息来捕获视频的内容。然后将关键帧馈送到基于CNN的深度学习模型以训练其参数。训练有素的参数用于生成最频繁的标签。频繁使用的标签用于汇总输入视频。所提议的技术以最广泛使用的视频活动数据集UCF-101为基准。我们的方法设法达到99.8％的整体准确率和96.2％的F1分数。

著录项

来源
《International Conference on Emerging Technologies》|2019年|1-6|共6页
会议地点
作者
Sadia Ilyas; Hafeez Ur Rehman;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
convolutional neural nets; feature extraction; image capture; image motion analysis; learning (artificial intelligence); ontologies (artificial intelligence); video retrieval; video signal processing;

机译：卷积神经网络;特征提取;图像捕获;图像运动分析;学习（人工智能）;本体论（人工智能）;视频检索;视频信号处理;

相似文献

外文文献
中文文献
专利

1. An automated approach to retrieve lecture videos using context based semantic features and deep learning [J] . N POORNIMA, B SALEENA Sadhana . 2020,第1期

机译：一种使用基于上下文的语义特征和深度学习来检索讲座视频的自动方法
2. Deep learning-based scene-awareness approach for intelligent change detection in videos [J] . Chan Yi-Tung Journal of electronic imaging . 2019,第1期

机译：基于深度学习的场景感知方法，用于视频中的智能变化检测
3. LWSINet: A deep learning-based approach towards video script identification [J] . Ghosh Mridul, Mukherjee Himadri, Obaidullah Sk Md, Multimedia Tools and Applications . 2021,第19期

机译：LWSINET：基于深度学习的视频脚本识别方法
4. A Deep Learning based Approach for Precise Video Tagging [C] . Sadia Ilyas, Hafeez Ur Rehman International Conference on Emerging Technologies . 2019

机译：基于深度学习的精确视频标记方法
5. Anomaly Detection from Videos: A Deep Learning Approach [D] . Jacob, Seby. 2019

机译：来自视频的异常检测：深入的学习方法
6. A Deep Learning-Based Approach to Video-Based Eye Tracking for Human Psychophysics [O] . Niklas Zdarsky, Stefan Treue, Moein Esghaei 2021

机译：一种基于深入的人体心理物理学对视频眼跟踪的方法
7. Occupational therapy intervention with a child is based upon an understanding and appreciation of normal development. Knowledge of current concepts and theories related to child development is essential when occupational therapist evaluates children. This background information helps therapist to plan intervention for the child. The aim of this study is to make observation video about development of about one year old child. The purpose of my study is to help occupational therapy students learn about child development. My study is practice-based thesis. It includes product, which is the observation video and study rapport. I describe my whole process in my rapport. The process includes different kinds of stages. First, I studied those theories of child development, which are used in the studies of occupational therapy for children. These theories are Moseys Developmental Frame of Reference and the theory of development according to Sensory Integration Theory. These theories are the frames of reference of my study. I organize the child development areas according to child occupations and skills. Then I start to plan, film and edit my video based on the theories of child development and the principles of making a video. In my rapport I describe all the stages of my study and explain the sequence and the content of the stages. I also evaluate the process of my study. In the observation video you can see those stages of development where about one year old child is based on the frames of reference, which I have used in my study. I believe that my observation video can at least be good for inspiring occupational therapy students learning about child development. Keywords child development, learning, observation video [O] . Lehtinen Ann-Mari 2006

机译：对儿童的职业治疗干预基于对正常发育的理解和欣赏。当职业治疗师评估儿童时，与儿童发育相关的当前概念和理论的知识必不可少。这些背景信息可帮助治疗师为孩子计划干预措施。这项研究的目的是制作有关约一岁儿童发育的观察视频。我研究的目的是帮助职业治疗学生学习儿童成长。我的研究是基于实践的论文。它包括产品，这是观察视频和学习融洽的关系。我以融洽的方式描述我的整个过程。该过程包括不同阶段。首先，我研究了有关儿童发育的理论，这些理论被用于儿童的职业治疗研究中。这些理论是Moseys发展参考框架和根据感觉统合理论的发展理论。这些理论是我研究的参考框架。我根据儿童职业和技能组织儿童发展领域。然后，我根据儿童发育理论和视频制作原理开始计划，拍摄和编辑视频。在融洽的关系中，我描述了学习的所有阶段，并解释了这些阶段的顺序和内容。我还评估了我的学习过程。在观察视频中，您可以看到那些发展阶段，其中大约一岁的孩子基于我的研究框架。我相信，我的观察视频至少可以对激发职业治疗的学生学习儿童发育有帮助。关键字儿童发展，学习，观察视频

A Deep Learning based Approach for Precise Video Tagging

摘要

著录项

相似文献

相关主题

期刊订阅