首页> 外文期刊>Technical Gazette >Application of Video Scene Semantic Recognition Technology in Smart Video
【24h】

Application of Video Scene Semantic Recognition Technology in Smart Video

机译:视频场景语义识别技术在智能视频中的应用

获取原文
           

摘要

Video behaviour recognition and semantic recognition understanding are important components of intelligent video analytics. Traditionally, human behaviour recognition has met problems of low recognition efficiencies and poor accuracies. For example, most existing behaviour recognition methods use the video frames obtained by even segmentation and fixed sampling as the input, which may lose important information between sampling intervals, fail to identify the key frames of the video segments and make use of the contextual semantics to understand current behaviour. In order to improve the semantic understanding capacity and efficiency of video segments, this paper adopts a 3-layer semantic recognition approach based on key frame extraction. First, it completes the segmentation for video recognition at the bottom layer, extracts the key frames in the video segments, primarily understands basic semantics of the persons’ identifications, behaviours and environment, and then introduces the primarily acquired information into the middle layer for semantic integration, and through the integration of various semantics, adopts the loss function to learn the latent relationship between different modal semantics, to enhance the integrating capacity and the robustness of the character semantic integration, and finally, by overall fine tuning, semantic recognition and adjusting all the parameters of the network, completes the semantic recognition task of the video scenario. This method enjoys higher recognition accuracies based on certain datasets, capable of effectively recognizing the semantics of characters and behaviours in videos. Through practical testing, the adoption of the algorithm integrating key frame extractions with the video scene semantic recognition has improved the recognition accuracy and effect of the video character semantics.
机译:视频行为识别和语义识别理解是智能视频分析的重要组成部分。传统上,人类行为识别遇到识别效率低和准确性差的问题。例如,大多数现有的行为识别方法都使用通过均匀分割和固定采样获得的视频帧作为输入,这可能会丢失采样间隔之间的重要信息,无法识别视频片段的关键帧,并且无法利用上下文语义来实现。了解当前行为。为了提高视频片段的语义理解能力和效率,本文采用了基于关键帧提取的三层语义识别方法。首先,在底层完成视频识别的分割,提取视频片段中的关键帧,首先了解人的身份,行为和环境的基本语义,然后将最初获取的信息引入中间层进行语义集成,并通过各种语义的集成,采用损失函数学习不同模态语义之间的潜在关系,增强字符语义集成的集成能力和鲁棒性,最后通过整体的微调,语义识别和调整网络的所有参数,完成了视频场景的语义识别任务。该方法基于某些数据集享有较高的识别精度,能够有效识别视频中人物的语义和行为。通过实际测试,采用结合关键帧提取和视频场景语义识别的算法,提高了视频字符语义的识别精度和效果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号