首页> 外文期刊>Information Sciences: An International Journal >Perceptual multi-channel visual feature fusion for scene categorization
【24h】

Perceptual multi-channel visual feature fusion for scene categorization

机译:感知多通道视觉特征融合,用于场景分类

获取原文
获取原文并翻译 | 示例
           

摘要

Effectively recognizing sceneries from a variety of categories is an indispensable but challenging technique in computer vision and intelligent systems. In this work, we propose a novel image kernel based on human gaze shifting, aiming at discovering the mechanism of humans perceiving visually/semantically salient regions within a scenery. More specifically, we first design a weakly supervised embedding algorithm which projects the local image features (i.e., graphlets in this work) onto the pre-defined semantic space. Thereby, we describe each graphlet by multiple visual features at both low-level and high-level. It is generally acknowledged that humans attend to only a few regions within a scenery. Thus we formulate a sparsity-constrained graphlet ranking algorithm which incorporates visual clues at both the low-level and the high-level. According to human visual perception, these top-ranked graphlets are either visually or semantically salient. We sequentially connect them into a path which mimics human gaze shifting. Lastly, a so-called gaze shifting kernel (GSK) is calculated based on the learned paths from a collection of scene images. And a kernel SVM is employed for calculating the scene categories. Comprehensive experiments on a series of well-known scene image sets shown the competitiveness and robustness of our GSK. We also demonstrated the high consistency of the predicted path with real human gaze shifting path. (C) 2017 Published by Elsevier Inc.
机译:有效地识别来自各种类别的风景是计算机视觉和智能系统中不可或缺的但具有挑战性的技术。在这项工作中,我们提出了一种基于人凝视的新型图像内核,旨在发现人类在风景中感知视觉/语义突出区域的人体的机制。更具体地,我们首先设计一个弱监督的嵌入算法,将本地图像特征(即,在本工作中的图形)投影到预定义的语义空间上。因此,我们在低级和高电平下通过多个视觉功能描述每个石墨。人们普遍承认,人类只参加了景象内的几个地区。因此,我们制定了一种稀疏性约束的石墨簇排名算法,其包括低级和高级的视觉线索。根据人类视觉感知,这些排名级的石斑圈在视觉上或语义上突出。我们顺序地将它们连接到模拟人凝视的路径中。最后,基于来自场景图像集合的学习路径计算所谓的凝视转换内核(GSK)。和内核SVM用于计算场景类别。关于一系列知名场景图像集的综合实验显示了我们的GSK的竞争力和鲁棒性。我们还证明了预测路径与真正的人凝视移位路径的高一致性。 (c)2017年由elsevier公司发布

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号