Exploiting textual and visual features for image categorization

Yao Yazhou; Yang Wankou; Huang Pu; Wang Qiong; Cai Yunfei; Tang Zhenmin

首页> 外文期刊>Pattern recognition letters >Exploiting textual and visual features for image categorization

【24h】

Exploiting textual and visual features for image categorization

机译：利用文本和视觉功能进行图像分类

获取原文

获取原文并翻译 | 示例

开具论文收录证明 >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

Studies show that refining real-world categories into semantic subcategories contributes to better image modeling and classification. Previous image sub-categorization work relying on labeled images and WordNet's hierarchy is labor-intensive. To tackle this problem, in this work, we extract textual and visual features to automatically select and subsequently classify web images into semantic rich categories. The following two major challenges are well studied: (1) noise in the labels of subcategories derived from the general corpus; (2) noise in the labels of images retrieved from the web. Specifically, we first obtain the semantic refinement subcategories from the text perspective and remove the noise by using the relevance-based approach. To suppress the search error induced noisy images, we then formulate image selection and classifier learning as a multi-instance learning problem and propose to solve the employed problem by the cutting-plane algorithm. The experiments show significant performance gains by using the generated data of our approach on image categorization tasks. The proposed approach also consistently outperforms existing weakly supervised and web-supervised approaches. (C) 2018 Published by Elsevier B.V.

机译：研究表明，将现实世界的类别细化为语义子类别有助于更好地进行图像建模和分类。以前的图像子分类工作依赖于标记的图像，而WordNet的层次结构是劳动密集型的。为了解决这个问题，在这项工作中，我们提取文本和视觉功能以自动选择Web图像，然后将其分类为语义丰富的类别。对以下两个主要挑战进行了深入研究：（1）来自一般语料库的子类别标签中的噪声；（2）从网络上检索到的图像标签中的噪点。具体来说，我们首先从文本角度获得语义细化子类别，然后使用基于相关性的方法消除噪声。为了抑制搜索错误引起的噪点图像，我们将图像选择和分类器学习公式化为多实例学习问题，并提出通过切平面算法解决所采用的问题。通过使用我们针对图像分类任务的方法生成的数据，实验显示出显着的性能提升。所提出的方法还始终优于现有的弱监督和Web监督方法。（C）2018由Elsevier B.V.发布

著录项

来源
《Pattern recognition letters》 |2019年第1期|140-145|共6页
作者
Yao Yazhou; Yang Wankou; Huang Pu; Wang Qiong; Cai Yunfei; Tang Zhenmin;
展开▼
作者单位

Nanjing Univ Posts & Telecommun, Jiangsu Key Lab Big Data Secur & Intelligent Proc, Nanjing 210023, Jiangsu, Peoples R China;

SouthEast Univ, Sch Automat, Nanjing 210096, Jiangsu, Peoples R China;

Univ Technol Sydney, Global Big Data Technol Ctr, Sydney, NSW 2007, Australia;

Nanjing Univ Sci & Technol, Sch Comp Sci & Engn, Nanjing 210094, Jiangsu, Peoples R China;

展开▼
收录信息美国《科学引文索引》(SCI);美国《工程索引》(EI);
原文格式 PDF
正文语种 eng
中图分类
关键词
General corpus information; Image categorization; Web-supervised;

机译：通用语料库信息;图像分类;网络监督;

相似文献

外文文献
中文文献
专利

1. A decisive content based image retrieval approach for feature fusion in visual and textual images [J] . Unar Salahuddin, Wang Xingyuan, Wang Chunpeng, Knowledge-Based Systems . 2019,第SEPa1期

机译：基于决定性内容的图像检索方法，用于视觉和文本图像中的特征融合
2. A decisive content based image retrieval approach for feature fusion in visual and textual images [J] . Unar Salahuddin, Wang Xingyuan, Wang Chunpeng, Knowledge-Based Systems . 2019,第Sepa1期

机译：基于果实基于内容的视觉和文本图像特征融合的图像检索方法
3. Large Scale Near-Duplicate Celebrity Web Images Retrieval Using Visual and Textual Features [J] . FengcaiQiao, ChengWang, XinZhang, ScientificWorldJournal . 2013,第3期

机译：使用视觉和文本功能的大规模近重型名人网络图像检索
4. Improving Semantic Scene Categorization by Exploiting Audio-Visual Features [C] . Songhao Zhu, Junchi Yan, Yuncai Liu Image and Graphics, 2009. ICIG '09 . 2010

机译：通过利用视听功能改进语义场景分类
5. The Role of Visual Features in the Affective Categorization of Briefly Presented Naturalistic Scenes [D] . Rhodes, L. Jack. 2019

机译：视觉特征在简单呈现自然主义场景的情感分类中的作用
6. Large Scale Near-Duplicate Celebrity Web Images Retrieval Using Visual and Textual Features [O] . Fengcai Qiao, Cheng Wang, Xin Zhang, 2013

机译：使用视觉和文字功能进行大规模近乎重复的名人Web图像检索
7. Selecting and Categorizing Textual Descriptions of Images in the Context of an Image Indexer's Toolkit [O] . Passonneau Rebecca J., Yano Tae, Klavans Judith L., 2007

机译：在图像索引器工具包的上下文中选择和分类图像的文本描述

Exploiting textual and visual features for image categorization

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅