Aggregating Rich Deep Semantic Features for Fine-Grained Place Classification

机译：聚集丰富的深度语义特征以进行细粒度分类

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

This paper proposes a method that aggregates rich deep semantic features for fine-grained place classification. As is known to all, the category of images depends on the objects and text as well as the various semantic regions, hierarchical structure, and spatial layout. However, most recently designed fine-grained classification systems ignored this, the complex multi-level semantic structure of images associated with fine-grained classes has not yet been well explored. Therefore, in this work, our approach composed of two modules: Content Estimator (CNE) and Context Estimator (CXE). CNE generates deep content features by encoding global visual cues of images. CXE obtains rich context features of images, and it consists of three children Estimator: Text Context Estimator (TCE), Object Context Estimator (OCE), and Scene Context Estimator (SCE). When inputting an image into CXE, TCE encodes text cues to identify word-level semantic information, OCE extracts high-dimensional feature then maps it to object semantic information, SCE gains hierarchical structure and spatial layout information by recognizing scene cues. To aggregate rich deep semantic features, we fuse the information about CNE and CXE for fine-grained classification. To the best of our knowledge, this is the first work to leverage the text information from an arbitrary-oriented scene text detector for extracting context information. Moreover, our method explores the fusion of semantic features and demonstrates scene features to give more complementary information with the other cues. Furthermore, the proposed approach achieves state-of-the-art performance on a fine-grained classification dataset, 84.3% on Con-Text.

机译：本文提出了一种聚集丰富的深层语义特征的细粒度场所分类的方法。众所周知，图像的类别取决于对象和文本以及各种语义区域，层次结构和空间布局。但是，最近设计的细粒度分类系统忽略了这一点，与细粒度类关联的图像的复杂多级语义结构尚未得到很好的探索。因此，在这项工作中，我们的方法由两个模块组成：内容估计器（CNE）和上下文估计器（CXE）。 CNE通过对图像的全局视觉提示进行编码来生成深层的内容功能。 CXE获得了丰富的图像上下文特征，它由三个子估计器组成：文本上下文估计器（TCE），对象上下文估计器（OCE）和场景上下文估计器（SCE）。当将图像输入到CXE中时，TCE对文本线索进行编码以识别单词级语义信息，OCE提取高维特征，然后将其映射到对象语义信息，SCE通过识别场景线索获得层次结构和空间布局信息。为了聚合丰富的深层语义特征，我们将有关CNE和CXE的信息融合在一起以进行细粒度的分类。据我们所知，这是利用来自面向任意方向的场景文本检测器的文本信息提取上下文信息的第一项工作。此外，我们的方法探索了语义特征的融合，并演示了场景特征，以提供与其他线索更多的互补信息。此外，所提出的方法在细粒度分类数据集上达到了最先进的性能，在Con-Text上达到了84.3％。

著录项

来源
《International Conference on Artificial Neural Networks》|2019年|55-67|共13页
会议地点
作者
Tingyu Wei; Wenxin Hu; Xingjiao Wu; Yingbin Zheng; Hao Ye; Jing Yang; Liang He;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Semantic features; Fine-grained place classification; Scene text detector; Scene features;

机译：语义特征;细粒度分类;场景文本检测器;场景特征;
入库时间 2022-08-26 13:53:48

相似文献

外文文献
中文文献
专利

1. Fine-grained image classification with factorized deep user click feature [J] . Min Tan, Jian Zhou, Zhiyou Peng, Information Processing & Management . 2020,第3期

机译：精细的图像分类和深度用户点击功能
2. Using multi-scale and hierarchical deep convolutional features for 3D semantic classification of TLS point clouds [J] . Guo Zhou, Feng Chen-Chieh International Journal of Geographical Information Science . 2020,第3a4期

机译：使用多尺度和分层深度卷积特征，用于3D语义分类TLS点云
3. Semantic region of interest and species classification in the deep neural network feature domain [J] . Ahmed Ahmed, Yousif Hayder, Kays Roland, Ecological informatics: an international journal on ecoinformatics and computational ecology . 2019,第期

机译：深度神经网络特征域中的兴趣和物种分类的语义区域
4. Aggregating Rich Deep Semantic Features for Fine-Grained Place Classification [C] . Tingyu Wei, Wenxin Hu, Xingjiao Wu, International Conference on Artificial Neural Networks . 2019

机译：聚集丰富的深层语义特征，用于细粒度的分类
5. Feature Engineering in Fine-Grained Image Classification. [D] . Yang, Shulin. 2013

机译：细粒度图像分类中的特征工程。
6. Analysis of syntactic and semantic features for fine-grained event-spatial understanding in outbreak news reports [O] . Hutchatai Chanlekha, Nigel Collier 2010

机译：分析语法和语义特征以更精确地了解爆发新闻报道中的事件空间
7. Learning Semantically Enhanced Feature for Fine-Grained Image Classification [O] . Wei Luo, Hengmin Zhang, Jun Li, 2020

机译：学习用于细粒度图像分类的语义增强功能
8. Keypoint Density-Based Region Proposal for Fine-Grained Object Detection and Classification Using Regions with Convolutional Neural Network Features. [R] . Turner, J. T., Gupta, K., Morris, B., 2015

机译：基于关键点密度的区域提议，用于使用具有卷积神经网络特征的区域进行细粒度目标检测和分类。

Aggregating Rich Deep Semantic Features for Fine-Grained Place Classification

摘要

著录项

相似文献

相关主题

期刊订阅