Perceptual multi-channel visual feature fusion for scene categorization

Sun Xiao; Liu Zhenguang; Hu Yuxing; Zhang Luming; Zimmermann Roger

首页> 外文期刊>Information Sciences: An International Journal >Perceptual multi-channel visual feature fusion for scene categorization

【24h】

Perceptual multi-channel visual feature fusion for scene categorization

机译：感知多通道视觉特征融合，用于场景分类

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Effectively recognizing sceneries from a variety of categories is an indispensable but challenging technique in computer vision and intelligent systems. In this work, we propose a novel image kernel based on human gaze shifting, aiming at discovering the mechanism of humans perceiving visually/semantically salient regions within a scenery. More specifically, we first design a weakly supervised embedding algorithm which projects the local image features (i.e., graphlets in this work) onto the pre-defined semantic space. Thereby, we describe each graphlet by multiple visual features at both low-level and high-level. It is generally acknowledged that humans attend to only a few regions within a scenery. Thus we formulate a sparsity-constrained graphlet ranking algorithm which incorporates visual clues at both the low-level and the high-level. According to human visual perception, these top-ranked graphlets are either visually or semantically salient. We sequentially connect them into a path which mimics human gaze shifting. Lastly, a so-called gaze shifting kernel (GSK) is calculated based on the learned paths from a collection of scene images. And a kernel SVM is employed for calculating the scene categories. Comprehensive experiments on a series of well-known scene image sets shown the competitiveness and robustness of our GSK. We also demonstrated the high consistency of the predicted path with real human gaze shifting path. (C) 2017 Published by Elsevier Inc.

机译：有效地识别来自各种类别的风景是计算机视觉和智能系统中不可或缺的但具有挑战性的技术。在这项工作中，我们提出了一种基于人凝视的新型图像内核，旨在发现人类在风景中感知视觉/语义突出区域的人体的机制。更具体地，我们首先设计一个弱监督的嵌入算法，将本地图像特征（即，在本工作中的图形）投影到预定义的语义空间上。因此，我们在低级和高电平下通过多个视觉功能描述每个石墨。人们普遍承认，人类只参加了景象内的几个地区。因此，我们制定了一种稀疏性约束的石墨簇排名算法，其包括低级和高级的视觉线索。根据人类视觉感知，这些排名级的石斑圈在视觉上或语义上突出。我们顺序地将它们连接到模拟人凝视的路径中。最后，基于来自场景图像集合的学习路径计算所谓的凝视转换内核（GSK）。和内核SVM用于计算场景类别。关于一系列知名场景图像集的综合实验显示了我们的GSK的竞争力和鲁棒性。我们还证明了预测路径与真正的人凝视移位路径的高一致性。（c）2017年由elsevier公司发布

著录项

来源
《Information Sciences: An International Journal》 |2018年第2018期|共12页
作者
Sun Xiao; Liu Zhenguang; Hu Yuxing; Zhang Luming; Zimmermann Roger;
展开▼
作者单位

Hefei Univ Technol Sch Comp &

Informat Hefei Anhui Peoples R China;

Natl Univ Singapore Sch Comp Singapore Singapore;

Tsinghua Univ Sch Aerosp Engn Beijing Peoples R China;

Hefei Univ Technol Sch Comp &

Informat Hefei Anhui Peoples R China;

Natl Univ Singapore Sch Comp Singapore Singapore;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类自动信息理论;计算机的应用;信息与知识传播;自动化技术、计算机技术;
关键词
Image kernel; Feature fusion; Scene categoriztion; Perception;

机译：图像内核;特征融合;场景分类;感知;

相似文献

外文文献
中文文献
专利

1. Perceptual multi-channel visual feature fusion for scene categorization [J] . Sun Xiao, Liu Zhenguang, Hu Yuxing, Information Sciences: An International Journal . 2018,第期

机译：感知多通道视觉特征融合，用于场景分类
2. Feature fusion within local region using localized maximum-margin learning for scene categorization [J] . Qin J., Yung N.H.C. Pattern Recognition: The Journal of the Pattern Recognition Society . 2012,第4期

机译：使用局部最大余量学习进行场景分类的局部区域内特征融合
3. Multi-channel biomimetic visual transformation for object feature extraction and recognition of complex scenes [J] . Yu Lingli, Jin Mingyue, Zhou Kaijun Applied Intelligence: The International Journal of Artificial Intelligence, Neural Networks, and Complex Problem-Solving Technologies . 2020,第3期

机译：对象特征提取和复杂场景识别的多通道仿真视觉变换
4. Gaze Shifting Kernel: Engineering Perceptually-Aware Features for Scene Categorization [C] . Luming Zhang, Richang Hong, Meng Wang Pacific-Rim conference on multimedia . 2015

机译：视线转换内核：工程感知感知功能，用于场景分类
5. The Role of Visual Features in the Affective Categorization of Briefly Presented Naturalistic Scenes [D] . Rhodes, L. Jack. 2019

机译：视觉特征在简单呈现自然主义场景的情感分类中的作用
6. Disentangling the Independent Contributions of Visual and Conceptual Features to the Spatiotemporal Dynamics of Scene Categorization [O] . Michelle R. Greene, Bruce C. Hansen 2020

机译：解开视觉和概念特征的独立贡献对场景分类的时空动态
7. mCENTRIST: A Multi-Channel Feature Generation Mechanism for Scene Categorization [O] . Yang Xiao, Jianxin Wu, Junsong Yuan 2015

机译：mCENTRIsT：场景分类的多通道特征生成机制
8. Perceptual Dimensions of Simulated Scenes Relevant for Visual Low-Altitude Flight [R] . Kleiss, J. A. 1995

机译：与视觉低空飞行相关的模拟场景的感知维度

Perceptual multi-channel visual feature fusion for scene categorization

摘要

著录项

相似文献

相关主题

期刊订阅