Application of Video Scene Semantic Recognition Technology in Smart Video

Qin Lele; Kang Lihua

首页> 外文期刊>Technical Gazette >Application of Video Scene Semantic Recognition Technology in Smart Video

【24h】

Application of Video Scene Semantic Recognition Technology in Smart Video

机译：视频场景语义识别技术在智能视频中的应用

获取原文

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Video behaviour recognition and semantic recognition understanding are important components of intelligent video analytics. Traditionally, human behaviour recognition has met problems of low recognition efficiencies and poor accuracies. For example, most existing behaviour recognition methods use the video frames obtained by even segmentation and fixed sampling as the input, which may lose important information between sampling intervals, fail to identify the key frames of the video segments and make use of the contextual semantics to understand current behaviour. In order to improve the semantic understanding capacity and efficiency of video segments, this paper adopts a 3-layer semantic recognition approach based on key frame extraction. First, it completes the segmentation for video recognition at the bottom layer, extracts the key frames in the video segments, primarily understands basic semantics of the persons’ identifications, behaviours and environment, and then introduces the primarily acquired information into the middle layer for semantic integration, and through the integration of various semantics, adopts the loss function to learn the latent relationship between different modal semantics, to enhance the integrating capacity and the robustness of the character semantic integration, and finally, by overall fine tuning, semantic recognition and adjusting all the parameters of the network, completes the semantic recognition task of the video scenario. This method enjoys higher recognition accuracies based on certain datasets, capable of effectively recognizing the semantics of characters and behaviours in videos. Through practical testing, the adoption of the algorithm integrating key frame extractions with the video scene semantic recognition has improved the recognition accuracy and effect of the video character semantics.

机译：视频行为识别和语义识别理解是智能视频分析的重要组成部分。传统上，人类行为识别遇到识别效率低和准确性差的问题。例如，大多数现有的行为识别方法都使用通过均匀分割和固定采样获得的视频帧作为输入，这可能会丢失采样间隔之间的重要信息，无法识别视频片段的关键帧，并且无法利用上下文语义来实现。了解当前行为。为了提高视频片段的语义理解能力和效率，本文采用了基于关键帧提取的三层语义识别方法。首先，在底层完成视频识别的分割，提取视频片段中的关键帧，首先了解人的身份，行为和环境的基本语义，然后将最初获取的信息引入中间层进行语义集成，并通过各种语义的集成，采用损失函数学习不同模态语义之间的潜在关系，增强字符语义集成的集成能力和鲁棒性，最后通过整体的微调，语义识别和调整网络的所有参数，完成了视频场景的语义识别任务。该方法基于某些数据集享有较高的识别精度，能够有效识别视频中人物的语义和行为。通过实际测试，采用结合关键帧提取和视频场景语义识别的算法，提高了视频字符语义的识别精度和效果。

著录项

来源
《Technical Gazette》 |2018年第5期|共8页
作者
Qin Lele; Kang Lihua;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种
中图分类一般工业技术;
关键词

相似文献

外文文献
中文文献
专利

1. Semantic-based technologies for video analysis in activity recognition, video surveillance and smart home domains [J] . Ritrovato Pierluigi, Greco Luca Journal of ambient intelligence and humanized computing . 2020,第8期

机译：基于语义的动态识别视频分析技术，视频监控和智能家庭域
2. Mutual Complementarity: Multi-Modal Enhancement Semantic Learning for Micro-Video Scene Recognition [J] . Guo Jie, Nie Xiushan, Yin Yilong Quality Control, Transactions . 2020,第期

机译：互补性：微型视频场景识别的多模态增强语义学习
3. Exploiting information extraction techniques for automatic semantic video indexing with an application to Turkish news videos [J] . Dilek Kuecuek, Adnan Yazici Knowledge-Based Systems . 2011,第6期

机译：利用用于自动语义视频索引的信息提取技术及其在土耳其新闻视频中的应用
4. Atomic Scenes for Scalable Traffic Scene Recognition in Monocular Videos [C] . Chao-Yeh Chen, Wongun Choi, Manmohan Chandraker IEEE Conference on Applications of Computer Vision . 2016

机译：单眼视频中可扩展交通场景识别的原子场景
5. Three-dimensional scene estimation from monocular videos with applications in video analysis. [D] . Donate, Arturo. 2011

机译：单眼视频的三维场景估计及其在视频分析中的应用。
6. Real-time Smartphone Activity Classification Using Inertial Sensors—Recognition of Scrolling Typing and Watching Videos While Sitting or Walking [O] . Sijie Zhuo, Lucas Sherlock, Gillian Dobbie, 2020

机译：使用惯性传感器的实时智能手机活动分类-识别坐着或走路时的滚动打字和观看视频
7. Semantic-based technologies for video analysis in activity recognition, video surveillance and smart home domains [O] . Pierluigi Ritrovato, Luca Greco 2020

机译：基于语义的动态识别视频分析技术，视频监控和智能家庭域

Application of Video Scene Semantic Recognition Technology in Smart Video

摘要

著录项

相似文献

相关主题

期刊订阅