首页> 外文期刊>Pattern recognition letters >Exploiting textual and visual features for image categorization
【24h】

Exploiting textual and visual features for image categorization

机译:利用文本和视觉功能进行图像分类

获取原文
获取原文并翻译 | 示例
           

摘要

Studies show that refining real-world categories into semantic subcategories contributes to better image modeling and classification. Previous image sub-categorization work relying on labeled images and WordNet's hierarchy is labor-intensive. To tackle this problem, in this work, we extract textual and visual features to automatically select and subsequently classify web images into semantic rich categories. The following two major challenges are well studied: (1) noise in the labels of subcategories derived from the general corpus; (2) noise in the labels of images retrieved from the web. Specifically, we first obtain the semantic refinement subcategories from the text perspective and remove the noise by using the relevance-based approach. To suppress the search error induced noisy images, we then formulate image selection and classifier learning as a multi-instance learning problem and propose to solve the employed problem by the cutting-plane algorithm. The experiments show significant performance gains by using the generated data of our approach on image categorization tasks. The proposed approach also consistently outperforms existing weakly supervised and web-supervised approaches. (C) 2018 Published by Elsevier B.V.
机译:研究表明,将现实世界的类别细化为语义子类别有助于更好地进行图像建模和分类。以前的图像子分类工作依赖于标记的图像,而WordNet的层次结构是劳动密集型的。为了解决这个问题,在这项工作中,我们提取文本和视觉功能以自动选择Web图像,然后将其分类为语义丰富的类别。对以下两个主要挑战进行了深入研究:(1)来自一般语料库的子类别标签中的噪声; (2)从网络上检索到的图像标签中的噪点。具体来说,我们首先从文本角度获得语义细化子类别,然后使用基于相关性的方法消除噪声。为了抑制搜索错误引起的噪点图像,我们将图像选择和分类器学习公式化为多实例学习问题,并提出通过切平面算法解决所采用的问题。通过使用我们针对图像分类任务的方法生成的数据,实验显示出显着的性能提升。所提出的方法还始终优于现有的弱监督和Web监督方法。 (C)2018由Elsevier B.V.发布

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号