首页> 外国专利> VIDEO CONCEPT DETECTION USING MULTI-LAYER MULTI-INSTANCE LEARNING

VIDEO CONCEPT DETECTION USING MULTI-LAYER MULTI-INSTANCE LEARNING

机译：多层多实例学习的视频概念检测

页面导航

摘要
著录项
相似文献

摘要

Visual concepts contained within a video clip are classified based upon a set of target concepts. The clip is segmented into shots and a multi-layer multi-instance (MLMI) structured metadata representation of each shot is constructed. A set of pre-generated trained models of the target concepts is validated using a set of training shots. An MLMI kernel is recursively generated which models the MLMI structured metadata representation of each shot by comparing prescribed pairs of shots. The MLMI kernel is subsequently utilized to generate a learned objective decision function which learns a classifier for determining if a particular shot (that is not in the set of training shots) contains instances of the target concepts. A regularization framework can also be utilized in conjunction with the MLMI kernel to generate modified learned objective decision functions. The regularization framework introduces explicit constraints which serve to maximize the precision of the classifier.

机译：视频剪辑中包含的视觉概念根据一组目标概念进行分类。剪辑被分割为多个镜头，并构造了每个镜头的多层多实例（MLMI）结构化元数据表示。使用一组训练镜头来验证目标概念的一组预先生成的训练模型。递归生成MLMI内核，该内核通过比较规定的镜头对对每个镜头的MLMI结构化元数据表示进行建模。 MLMI内核随后用于生成学习的目标决策函数，该函数学习分类器，用于确定特定镜头（不在训练镜头集合中）是否包含目标概念的实例。正则化框架也可以与MLMI内核结合使用，以生成修改后的学习目标决策函数。正则化框架引入了显式约束，这些约束用于最大化分类器的精度。

著录项

公开/公告号US2009274434A1

专利类型
公开/公告日2009-11-05

原文格式PDF
申请/专利权人 TAO MEI;XIAN-SHENG HUA;SHIPENG LI;ZHIWEI GU;
展开▼

申请/专利号US20080111202
发明设计人 ZHIWEI GU;SHIPENG LI;TAO MEI;XIAN-SHENG HUA;
展开▼

申请日2008-04-29
分类号G11B27/00;
国家 US
入库时间 2022-08-21 19:33:53

相似文献

专利
外文文献
中文文献