Scene Categorization by Deeply Learning Gaze Behavior in a Semisupervised Context

Zhang Luming; Liang Ronghua; Yin Jianwei; Zhang Dongxiang; Shao Ling

首页> 外文期刊>Cybernetics, IEEE Transactions on >Scene Categorization by Deeply Learning Gaze Behavior in a Semisupervised Context

【24h】

Scene Categorization by Deeply Learning Gaze Behavior in a Semisupervised Context

机译：通过在半熟语境中深入学习凝视行为的场景分类

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Accurately recognizing different categories of sceneries with sophisticated spatial configurations is a useful technique in computer vision and intelligent systems, e.g., scene understanding and autonomous driving. Competitive accuracies have been observed by the deep recognition models recently. Nevertheless, these deep architectures cannot explicitly characterize human visual perception, that is, the sequence of gaze allocation and the subsequent cognitive processes when viewing each scenery. In this paper, a novel spatially aware aggregation network is proposed for scene categorization, where the human gaze behavior is discovered in a semisupervised setting. In particular, as semantically labeling a large quantity of scene images is labor-intensive, a semisupervised and structure-preserved non-negative matrix factorization (NMF) is proposed to detect a set of visually/semantically salient regions from each scenery. Afterward, the gaze shifting path (GSP) is engineered to characterize the process of humans perceiving each scene picture. To deeply describe each GSP, a novel spatially aware CNN termed SA-Net is developed. It accepts input regions with various shapes and statistically aggregates all the salient regions along each GSP. Finally, the learned deep GSP features from the entire scene images are fused into an image kernel, which is subsequently integrated into a kernel SVM to categorize different sceneries. Comparative experiments on six scene image sets have shown the advantage of our method.

机译：准确地识别具有复杂空间配置的不同类别的景手是计算机视觉和智能系统中的有用技术，例如，场景理解和自动驾驶。最近的深度识别模型已经观察到竞争的准确性。然而，这些深度架构不能明确地表征人类视觉感知，即在观看每个风景时凝视分配和随后的认知过程的序列。在本文中，提出了一种新颖的空间意识到的聚合网络，用于场景分类，其中在半经验设置中发现人凝视行为。特别地，作为大量场景图像是劳动密集型的，提出了一种半质化和结构保存的非负矩阵分解（NMF）以检测来自每个风景的一组视觉/语义突出区域。之后，设计了凝视移位路径（GSP）以表征感知每个场景图片的人类的过程。为了深入描述每个GSP，开发了一种新的空间意识的CNN被称为SA-Net。它接受具有各种形状的输入区域，并统计地聚集沿每个GSP的所有突出区域。最后，来自整个场景图像的学习深度GSP功能融合到图像内核中，随后将其集成到内核SVM中以对不同的景客进行分类。六场场景图像集的比较实验表明了我们方法的优势。

著录项

来源
《Cybernetics, IEEE Transactions on》 |2021年第8期|4265-4276|共12页
作者
Zhang Luming; Liang Ronghua; Yin Jianwei; Zhang Dongxiang; Shao Ling;
展开▼
作者单位

Zhejiang Univ Coll Comp Sci Hangzhou 310027 Peoples R China;

Zhejiang Univ Technol Coll Comp Sci & Technol Hangzhou 310023 Peoples R China;

Zhejiang Univ Coll Comp Sci Hangzhou 310027 Peoples R China;

Zhejiang Univ Coll Comp Sci Hangzhou 310027 Peoples R China;

Incept Inst Artificial Intelligence Abu Dhabi U Arab Emirates;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Semantics; Feature extraction; Sparse matrices; Visualization; Training; Kernel; Image recognition; Deep model; machine learning; non-negative matrix factorization (NMF); scene categorization; semisupervised;

机译：语义;特征提取;稀疏矩阵;可视化;培训;核心;图像识别;深模型;机器学习;非负矩阵分解（NMF）;场景分类;半质量;

相似文献

外文文献
中文文献
专利

1. Changes in Gaze Behavior during the Learning of the Epidural Technique with a Simulator in Anesthesia Novices [J] . Emanuele Capogna, Francesco Salvi, Angelica Del Vecchio, 麻醉学期刊（英文） . 2020,第011期
2. Research on the Robust Image Representation Scheme for Natural Scene Categorization [J] . LIU Jie, DU Junping, WANG Xiaoru 电子学报：英文版 . 2013,第002期
3. Mutual Information Based Codebooks Construction for Natural Scene Categorization [J] . XIE Wenjie, XU De, TANG Yingjun, 电子学报：英文版 . 2011,第003期
4. Scene Categorization Using Deeply Learned Gaze Shifting Kernel [J] . Sun Xiao, Zhang Luming, Wang Zepeng, Cybernetics, IEEE Transactions on . 2019,第6期

机译：使用深度学习的视线转换内核进行场景分类
5. Scene Categorization Using Deeply Learned Gaze Shifting Kernel [J] . Sun Xiao, Zhang Luming, Wang Zepeng, Cybernetics, IEEE Transactions on . 2019,第6期

机译：场景分类使用深受学习的凝视转移核
6. Deep Learning for Multilabel Land Cover Scene Categorization Using Data Augmentation [J] . Stivaktakis Radamanthys, Tsagkatakis Grigorios, Tsakalides Panagiotis IEEE Geoscience and Remote Sensing Letters . 2019,第7期

机译：使用数据增强对多标签土地覆盖场景分类进行深度学习
7. Real-time categorization of driver's gaze zone using the deep learning techniques [C] . In-Ho Choi, Sung Kyung Hong, Yong-Guk Kim International Conference on Big Data and Smart Computing . 2016

机译：使用深度学习技术对驾驶员的视线区域进行实时分类
8. Deep Features for Context-Aware Scene Text Image Enhancement and Interpretation [D] . Rong, Xuejian . 2019

机译：背景感知场景文本图像增强和解释的深度特征
9. Where’s Waldo? How perceptual cognitive and emotional brain processes cooperate during learning to categorize and find desired objects in a cluttered scene [O] . Hung-Cheng Chang, Stephen Grossberg, Yongqiang Cao 2014

机译：沃尔多在哪里？在学习中如何在混乱的场景中对感知认知和情感大脑过程进行协作以分类和找到所需的对象
10. Robust scene categorization by learning image statistics in context [O] . Jan C. van Gemert, Jan-Mark Geusebroek, Cor J. Veenman, 2006

机译：通过学习上下文中的图像统计信息进行可靠的场景分类
11. Gaze Control in Complex Scene Perception [R] . Henderson, J. M. 2004

机译：复杂场景感知中的凝视控制

Scene Categorization by Deeply Learning Gaze Behavior in a Semisupervised Context

摘要

著录项

相似文献

相关主题

期刊订阅