首页> 美国卫生研究院文献>other >Online Multi-Modal Robust Non-Negative Dictionary Learning for Visual Tracking
【2h】

Online Multi-Modal Robust Non-Negative Dictionary Learning for Visual Tracking

机译:在线多模态鲁棒非负字典学习的视觉跟踪

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

Dictionary learning is a method of acquiring a collection of atoms for subsequent signal representation. Due to its excellent representation ability, dictionary learning has been widely applied in multimedia and computer vision. However, conventional dictionary learning algorithms fail to deal with multi-modal datasets. In this paper, we propose an online multi-modal robust non-negative dictionary learning (OMRNDL) algorithm to overcome this deficiency. Notably, OMRNDL casts visual tracking as a dictionary learning problem under the particle filter framework and captures the intrinsic knowledge about the target from multiple visual modalities, e.g., pixel intensity and texture information. To this end, OMRNDL adaptively learns an individual dictionary, i.e., template, for each modality from available frames, and then represents new particles over all the learned dictionaries by minimizing the fitting loss of data based on M-estimation. The resultant representation coefficient can be viewed as the common semantic representation of particles across multiple modalities, and can be utilized to track the target. OMRNDL incrementally learns the dictionary and the coefficient of each particle by using multiplicative update rules to respectively guarantee their non-negativity constraints. Experimental results on a popular challenging video benchmark validate the effectiveness of OMRNDL for visual tracking in both quantity and quality.
机译:字典学习是一种获取原子集合以用于后续信号表示的方法。由于其出色的表示能力,字典学习已广泛应用于多媒体和计算机视觉。但是,常规的字典学习算法无法处理多模式数据集。在本文中,我们提出了一种在线多模式鲁棒非负字典学习(OMRNDL)算法,以克服这一缺陷。值得注意的是,OMRNDL在粒子过滤器框架下将视觉跟踪转换为字典学习问题,并从多种视觉模式(例如像素强度和纹理信息)中捕获有关目标的固有知识。为此,OMRNDL从可用帧中自适应地为每个模态学习单个字典,即模板,然后通过基于M估计最小化数据的拟合损失来在所有学习的字典上表示新粒子。所得表示系数可以看作是跨多个模态的粒子的常见语义表示,并且可以用来跟踪目标。 OMRNDL通过使用乘性更新规则来增量学习字典和每个粒子的系数,以分别保证它们的非负约束。在流行的具有挑战性的视频基准测试上的实验结果证明了OMRNDL在数量和质量上对视觉跟踪的有效性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号