首页> 外国专利> VIDEO CONCEPT DETECTION USING MULTI-LAYER MULTI-INSTANCE LEARNING

VIDEO CONCEPT DETECTION USING MULTI-LAYER MULTI-INSTANCE LEARNING

机译:多层多实例学习的视频概念检测

摘要

Visual concepts contained within a video clip are classified based upon a set of target concepts. The clip is segmented into shots and a multi-layer multi-instance (MLMI) structured metadata representation of each shot is constructed. A set of pre-generated trained models of the target concepts is validated using a set of training shots. An MLMI kernel is recursively generated which models the MLMI structured metadata representation of each shot by comparing prescribed pairs of shots. The MLMI kernel is subsequently utilized to generate a learned objective decision function which learns a classifier for determining if a particular shot (that is not in the set of training shots) contains instances of the target concepts. A regularization framework can also be utilized in conjunction with the MLMI kernel to generate modified learned objective decision functions. The regularization framework introduces explicit constraints which serve to maximize the precision of the classifier.
机译:视频剪辑中包含的视觉概念根据一组目标概念进行分类。剪辑被分割为多个镜头,并构造了每个镜头的多层多实例(MLMI)结构化元数据表示。使用一组训练镜头来验证目标概念的一组预先生成的训练模型。递归生成MLMI内核,该内核通过比较规定的镜头对对每个镜头的MLMI结构化元数据表示进行建模。 MLMI内核随后用于生成学习的目标决策函数,该函数学习分类器,用于确定特定镜头(不在训练镜头集合中)是否包含目标概念的实例。正则化框架也可以与MLMI内核结合使用,以生成修改后的学习目标决策函数。正则化框架引入了显式约束,这些约束用于最大化分类器的精度。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号