首页> 中文期刊> 《图学学报》 >视频语义上下文标签树及其结构化分析

视频语义上下文标签树及其结构化分析

         

摘要

Video content is strongly associated with time series and has a strong logical structure. Shot semantic is a kind of basic unit for understanding video content. From the point view of user cognition, among shot semantics, there are various context information hidden rather than explicit temporal association, such as logical and structural association. Obviously, it is important to describe these context information in an reasonable manner. Firstly, this paper presents a label tree with context label to represent the structured context as characterization model of video semantic context. Within the label tree, each shot semantic in a shot semantic sequence is taken as a leaf node and all inner nodes with context label is adopted to represent the inter-dependencies among its child nodes. More important, its hierarchical structure, corresponding to the hierarchical model of video content, leads to significant information gain for video content understanding. Furthermore, it is tough to construct a hierarchical video semantic context label tree from the shot semantic sequence, which needs to bridge from sequence space to tree structure space. Then, according to the combined feature of shot semantic sequence and video semantic label tree, this paper uses an SVM-Struct analysis to construct structural function and loss function for the semantic context and implement the construction of video semantic context label tree. The experimental results show that video semantic context label tree has a better characterization ability in many aspects. And SVM-Struct driven analysis ensures the characterization ability of video semantic label tree with high precision, recall and F1 rate.%视频内容具有非常强的时间关联和逻辑结构,镜头语义是视频内容理解的基本单元。从符合人类认识理解视频内容的角度来看,镜头语义之间隐含着时间上、语义上、结构上的多种上下文关联信息。合理地描述这种上下文信息至关重要。为此,首先采用一棵带有上下文标签的标签树作为镜头语义上下文层次结构的表征模型,以序列化的镜头语义序列为底层叶节点,以内节点的上下文标签表征镜头语义间的上下文关联,其树形结构与视频内容层次化表征形式一致,能为视频内容理解提供显著的信息增益。然后,着眼于解决镜头语义从其序列结构向标签树的层次结构转化,采用结构化支持向量机的分析方法,根据镜头语义序列和视频语义上下文标签树的联合特性构造了语义上下文结构化函数和损失函数,实现了镜头语义的结构化分析。实验结果表明,视频语义上下文标签树在时序性、层次性、领域性、逻辑性等方面具有良好的表征能力,而基于结构化支持向量机的结构化分析方法在镜头语义上下文分析的准确率、召回率及 F1值表现良好。

著录项

相似文献

  • 中文文献
  • 外文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号