首页>
外国专利>
VIDEO CONCEPT DETECTION USING MULTI-LAYER MULTI-INSTANCE LEARNING
VIDEO CONCEPT DETECTION USING MULTI-LAYER MULTI-INSTANCE LEARNING
展开▼
机译:多层多实例学习的视频概念检测
展开▼
页面导航
摘要
著录项
相似文献
摘要
Visual concepts contained within a video clip are classified based upon a set of target concepts. The clip is segmented into shots and a multi-layer multi-instance (MLMI) structured metadata representation of each shot is constructed. A set of pre-generated trained models of the target concepts is validated using a set of training shots. An MLMI kernel is recursively generated which models the MLMI structured metadata representation of each shot by comparing prescribed pairs of shots. The MLMI kernel is subsequently utilized to generate a learned objective decision function which learns a classifier for determining if a particular shot (that is not in the set of training shots) contains instances of the target concepts. A regularization framework can also be utilized in conjunction with the MLMI kernel to generate modified learned objective decision functions. The regularization framework introduces explicit constraints which serve to maximize the precision of the classifier.
展开▼