首页> 外文期刊>IEEE Transactions on Image Processing >Multitask Learning of Compact Semantic Codebooks for Context-Aware Scene Modeling
【24h】

Multitask Learning of Compact Semantic Codebooks for Context-Aware Scene Modeling

机译:紧凑型语义代码本的多任务学习,用于上下文感知场景建模

获取原文
获取原文并翻译 | 示例
       

摘要

In the past few decades, we have witnessed the success of bag-of-features (BoF) models in scene classification, object detection, and image segmentation. Whereas it is also well acknowledged that the limitation of BoF-based methods lies in the low-level feature encoding and coarse feature pooling. This paper proposes a novel scene classification method, which leverages several semantic codebooks learned in a multitask fashion for robust feature encoding, and designs a context-aware image representation for efficient feature pooling. Apart from conventional universal codebook learning approaches, the proposed method encodes each class of local features with a unique semantic codebook, which captures the distinct distribution of different semantic classes more effectively. Instead of learning each semantic codebook separately, we learn a compact global codebook, of which each semantic codebook is a sparse subset, with a two-stage iterative multitask learning algorithm. While minimizing the clustering divergence, the semantic codeword assignment is solved by submodular optimization simultaneously. Built upon the global and semantic codebooks, a context-aware image representation is further developed to encode both global and semantic features in image representation via contextual quantization, semantic response computation, and semantic pooling. Extensive experiments have been conducted to validate the effectiveness of the proposed method on various public benchmarks with several popular local features.
机译:在过去的几十年中,我们目睹了功能包(BoF)模型在场景分类,对象检测和图像分割方面的成功。众所周知,基于BoF的方法的局限性在于低级特征编码和粗略特征池。本文提出了一种新颖的场景分类方法,该方法利用以多任务方式学习的几种语义代码本进行鲁棒的特征编码,并设计一种上下文感知的图像表示以进行有效的特征池化。除了传统的通用码本学习方法外,所提出的方法还使用唯一的语义码本对每类局部特征进行编码,从而更有效地捕获了不同语义类的独特分布。我们没有学习单独的每个语义代码簿,而是学习了一个紧凑的全局代码簿,其中的每个语义代码簿都是一个稀疏子集,它具有两阶段的迭代多任务学习算法。在最小化聚类差异的同时,通过子模优化同时解决了语义代码字分配。基于全局和语义码本,可以进一步开发上下文感知图像表示,以通过上下文量化,语义响应计算和语义池对图像表示中的全局和语义特征进行编码。已经进行了广泛的实验,以验证该方法在具有几种流行的本地特征的各种公共基准上的有效性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号