Multitask Learning of Compact Semantic Codebooks for Context-Aware Scene Modeling

Botao Wang; Hongkai Xiong; Weiyao Lin; Junni Zou; Yuan F. Zheng

首页> 外文期刊>IEEE Transactions on Image Processing >Multitask Learning of Compact Semantic Codebooks for Context-Aware Scene Modeling

【24h】

Multitask Learning of Compact Semantic Codebooks for Context-Aware Scene Modeling

机译：紧凑型语义代码本的多任务学习，用于上下文感知场景建模

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

In the past few decades, we have witnessed the success of bag-of-features (BoF) models in scene classification, object detection, and image segmentation. Whereas it is also well acknowledged that the limitation of BoF-based methods lies in the low-level feature encoding and coarse feature pooling. This paper proposes a novel scene classification method, which leverages several semantic codebooks learned in a multitask fashion for robust feature encoding, and designs a context-aware image representation for efficient feature pooling. Apart from conventional universal codebook learning approaches, the proposed method encodes each class of local features with a unique semantic codebook, which captures the distinct distribution of different semantic classes more effectively. Instead of learning each semantic codebook separately, we learn a compact global codebook, of which each semantic codebook is a sparse subset, with a two-stage iterative multitask learning algorithm. While minimizing the clustering divergence, the semantic codeword assignment is solved by submodular optimization simultaneously. Built upon the global and semantic codebooks, a context-aware image representation is further developed to encode both global and semantic features in image representation via contextual quantization, semantic response computation, and semantic pooling. Extensive experiments have been conducted to validate the effectiveness of the proposed method on various public benchmarks with several popular local features.

机译：在过去的几十年中，我们目睹了功能包（BoF）模型在场景分类，对象检测和图像分割方面的成功。众所周知，基于BoF的方法的局限性在于低级特征编码和粗略特征池。本文提出了一种新颖的场景分类方法，该方法利用以多任务方式学习的几种语义代码本进行鲁棒的特征编码，并设计一种上下文感知的图像表示以进行有效的特征池化。除了传统的通用码本学习方法外，所提出的方法还使用唯一的语义码本对每类局部特征进行编码，从而更有效地捕获了不同语义类的独特分布。我们没有学习单独的每个语义代码簿，而是学习了一个紧凑的全局代码簿，其中的每个语义代码簿都是一个稀疏子集，它具有两阶段的迭代多任务学习算法。在最小化聚类差异的同时，通过子模优化同时解决了语义代码字分配。基于全局和语义码本，可以进一步开发上下文感知图像表示，以通过上下文量化，语义响应计算和语义池对图像表示中的全局和语义特征进行编码。已经进行了广泛的实验，以验证该方法在具有几种流行的本地特征的各种公共基准上的有效性。

著录项

来源
《IEEE Transactions on Image Processing》 |2016年第11期|5411-5426|共16页
作者
Botao Wang; Hongkai Xiong; Weiyao Lin; Junni Zou; Yuan F. Zheng;
展开▼
作者单位

Department of Electronic Engineering, Shanghai Jiao Tong University, Shanghai, China;

Department of Electronic Engineering, Shanghai Jiao Tong University, Shanghai, China;

Department of Electronic Engineering, Shanghai Jiao Tong University, Shanghai, China;

Key Laboratory of Special Fiber Optics and Optical Access Networks, Shanghai University, Shanghai, China;

Department of Electrical and Computer Engineering, The Ohio State University, Columbus, OH, USA;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Semantics; Image coding; Visualization; Image representation; Histograms; Quantization (signal); Context;

机译：语义;图像编码;可视化;图像表示;直方图;量化（信号）;上下文;

相似文献

外文文献
中文文献
专利

1. Semantic retrieval: multiple response model for context-aware learning services [J] . Xinyou Zhao, Qun Jin, Toshio Okamoto International Journal of Information Technology,Communications and Convergence . 2012,第3期

机译：语义检索：用于上下文感知学习服务的多响应模型
2. Learning semantic scene models from observing activity in visual surveillance [J] . Makris D., Ellis T. IEEE transactions on systems, man, and cybernetics. Part B, Cybernetics . 2005,第3期

机译：从视觉监控中的观察活动中学习语义场景模型
3. SUPPORTING BUSINESS MODEL INNOVATION BASED ON DEEP LEARNING SCENE SEMANTIC SEGMENTATION [J] . Nikolay Neshov, Agata Manolova, Krasimir Tonchev, Indian Journal of Computer Science and Engineering . 2020,第6期

机译：基于深度学习场景语义分割的支持商业模式创新
4. Learning Semantic Scene Models by Object Classification and Trajectory Clustering [C] . Tianzhu Zhang, Hanqing Lu, Stan Z. Li IEEE Conference on Computer Vision and Pattern Recognition . 2009

机译：通过对象分类和轨迹群集学习语义场景模型
5. Context-aware mobile learning on the Semantic Web. [D] . Zhang, Xiaoyun. 2008

机译：语义Web上的上下文感知移动学习。
6. Vehicle Detection by Fusing Part Model Learning and Semantic Scene Information for Complex Urban Surveillance [O] . Yingfeng Cai, Ze Liu, Hai Wang, 2018

机译：融合零件模型学习和语义场景信息的车辆检测用于复杂城市监控
7. Learning Semantic Scene Models by Trajectory Analysis [O] . Xiaogang Wang, Kinh Tieu, Eric Grimson 2006

机译：通过轨迹分析学习语义场景模型

Multitask Learning of Compact Semantic Codebooks for Context-Aware Scene Modeling

摘要

著录项

相似文献

相关主题

期刊订阅