CMIB: Unsupervised Image Object Categorization in Multiple Visual Contexts

Yan Xiaoqiang; Ye Yangdong; Qiu Xueying; Manic Milos; Yu Hui

首页> 外文期刊>IEEE transactions on industrial informatics >CMIB: Unsupervised Image Object Categorization in Multiple Visual Contexts

【24h】

CMIB: Unsupervised Image Object Categorization in Multiple Visual Contexts

机译：CMIB：多种视觉上下文中的无监督图像对象分类

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Object categorization in images is fundamental to various industrial areas, such as automated visual inspection, fast image retrieval, and intelligent surveillance. Most existing methods treat visual features (e.g., scale-invariant feature transform) as content information of the objects, while regarding image tags as their contextual information. However, the image tags can hardly be acquired in completely unsupervised settings, especially when the image volume is too large to be marked. In this article, we propose a novel contextual multivariate information bottleneck (CMIB) method to conduct unsupervised image object categorization in multiple visual contexts. Unlike using manual contexts, the CMIB method first automatically generates a set of high-level basic clusterings by multiple global features, which are unprecedentedly defined as visual contexts since they can provide overall information about the target images. Then, the idea of the data compression procedure for object category discovery is proposed, in which the content and multiple visual contexts are maximally preserved through a "bottleneck." Specifically, two Bayesian networks are initially built to characterize the relationship between data compression and information preservation. Finally, a novel sequential information-theoretic optimization is proposed to ensure the convergence of the CMIB objective function. Experimental results on seven real-world benchmark image datasets demonstrate that the CMIB method achieves better performance than the state-of-the-art baselines.

机译：图像中的对象分类是各种工业领域的基础，如自动视觉检查，快速图像检索和智能监测。大多数现有方法将视觉特征（例如，鳞片不变特征变换）视为对象的内容信息，同时将图像标签作为其上下文信息。然而，几乎不能在完全无监督的设置中获取图像标签，尤其是当图像体积太大而无法标记时。在本文中，我们提出了一种新颖的上下文多元信息瓶颈（CMIB）方法，以在多个视觉上下文中进行无监督的图像对象分类。与使用手动上下文不同，CMIB方法首先通过多个全局功能自动生成一组高级基本集群，这是前所未有地定义为可视上下文的特征，因为它们可以提供有关目标图像的整体信息。然后，提出了对对象类别发现的数据压缩过程的思想，其中内容和多个视觉上下文通过“瓶颈”是最大保留的。具体而言，最初建立了两个贝叶斯网络以表征数据压缩与信息保存之间的关系。最后，提出了一种新的顺序信息 - 理论优化，以确保CMIB目标函数的收敛性。七个实际基准图像数据集上的实验结果表明CMIB方法比最先进的基线实现了更好的性能。

著录项

来源
《IEEE transactions on industrial informatics》 |2020年第6期|3974-3986|共13页
作者
Yan Xiaoqiang; Ye Yangdong; Qiu Xueying; Manic Milos; Yu Hui;
展开▼
作者单位

Zhengzhou Univ Sch Informat Engn Zhengzhou 450052 Peoples R China;

Zhengzhou Univ Sch Informat Engn Zhengzhou 450052 Peoples R China;

Zhengzhou Univ Sch Informat Engn Zhengzhou 450052 Peoples R China;

Virginia Commonwealth Univ Sch Comp Sci Med Coll Virginia Campus Richmond VA 23284 USA;

Univ Portsmouth Sch Creat Technol Portsmouth PO1 2DJ Hants England;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Visualization; Bayes methods; Data compression; Clustering methods; Linear programming; Task analysis; Solids; Bayesian networks; information bottleneck (IB); mutual information; object category discovery; visual contexts;

机译：可视化;贝叶斯方法;数据压缩;聚类方法;线性规划;任务分析;固体;贝叶斯网络;信息瓶颈（IB）;互信息;对象类别发现;视觉上下文;
入库时间 2022-08-18 20:56:01

相似文献

外文文献
中文文献
专利

1. RotationNet for Joint Object Categorization and Unsupervised Pose Estimation from Multi-View Images [J] . Kanezaki Asako, Matsushita Yasuyuki, Nishida Yoshifumi IEEE Transactions on Pattern Analysis and Machine Intelligence . 2021,第1期

机译：来自多视图图像的联合对象分类和无监督姿态估计的旋转网络
2. Robust object categorization and segmentation motivated by visual contexts in the human visual system [J] . Kim S. EURASIP journal on advances in signal processing . 2011,第20aPta1期

机译：人类视觉系统中视觉上下文推动的鲁棒对象分类和分割
3. Robust Object Categorization and Segmentation Motivated by Visual Contexts in the Human Visual System [J] . Sungho Kim EURASIP journal on advances in signal processing . 2011,第1期

机译：人类视觉系统中视觉上下文推动的鲁棒对象分类和分割
4. Unsupervised Image Categorization and Object Localization using Topic Models and Correspondences between Images [C] . Liu David, Chen Tsuhan, ICCV IEEE International Conference on Computer Vision . 2007

机译：使用主题模型和图像之间的对象分类和对象本地化
5. Time course of visual object categorization and long-term memory [D] . Maher, Stephen M. 2008

机译：视觉对象分类的时程和长期记忆
6. NEUROIMAGING EVIDENCE FOR OBJECT MODEL VERIFICATION THEORY: ROLE OF PREFRONTAL CONTROL IN VISUAL OBJECT CATEGORIZATION [O] . Giorgio Ganis, Haline E. Schendan, Stephen M. Kosslyn -1

机译：对象模型验证理论的神经影像学证据：前控制在视觉对象分类中的作用
7. Unsupervised Image Categorization and Object Localization using Topic Models and Correspondences between Images [O] . 2008

机译：使用主题模型和图像之间的对应关系的无监督图像分类和对象定位

CMIB: Unsupervised Image Object Categorization in Multiple Visual Contexts

摘要

著录项

相似文献

相关主题

期刊订阅