Building a Multi-Modal Thesaurus from Annotated Images

机译：从带注释的图像构建多模态词库

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

We propose an unsupervised approach to learn associations between low-level visual features and keywords. We assume that a collection of images is available and that each image is globally annotated. The objective is to extract representative visual profiles that correspond to frequent homogeneous regions, and to associate them with keywords. These labeled profiles would be used to build a multi-modal thesaurus that could serve as a foundation for hybrid navigation and search algorithms. Our approach has two main steps. First, each image is coarsely segmented into regions, and visual features are extracted from each region. Second, the regions are categorized using a novel algorithm that performs clustering and feature weighting simultaneously. As a result, we obtain clusters of regions that share subsets of relevant features. Representatives from each cluster and their relevant visual and textual features would be used to build a thesaurus. The proposed approach is validated using a collection of 1169 images

机译：我们提出了一种无监督的方法来学习低级视觉功能和关键字之间的关联。我们假设有可用的图像集合，并且每个图像都是全局注释的。目的是提取与频繁的同质区域相对应的代表性视觉配置文件，并将其与关键字相关联。这些标记的配置文件将用于构建多模式同义词库，可以用作混合导航和搜索算法的基础。我们的方法有两个主要步骤。首先，将每个图像粗略地划分为多个区域，然后从每个区域中提取视觉特征。其次，使用新颖的算法对区域进行分类，该算法同时执行聚类和特征加权。结果，我们获得了共享相关特征子集的区域簇。每个群集的代表及其相关的视觉和文字功能将用于构建同义词库。所提出的方法已使用1169张图像进行了验证

著录项

来源
《Pattern Recognition, 2006. ICPR 2006》|2006年|p.198-201|共4页
会议地点
作者
Frigui H.; Caudill J.;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类模式识别与装置;
关键词

相似文献

外文文献
中文文献
专利

1. Building and Querying RDF/OWL Database of Semantically Annotated Nuclear Medicine Images [J] . Hwang Kyung Hoon, Lee Haejun, Koh Geon, Journal of digital imaging: the official journal of the Society for Computer Applications in Radiology . 2017,第1期

机译：构建和查询语义注释的核医学形象的RDF / OWL数据库
2. Object-based multi-modal convolution neural networks for building extraction using panchromatic and multispectral imagery [J] . Neurocomputing . 2020,第Apra21期

机译：基于全色和多光谱图像的建筑物提取的基于对象的多模式卷积神经网络
3. Modeling, classifying and annotating weakly annotated images using Bayesian network [J] . Sabine Barrat, Salvatore Tabbone Journal of visual communication & image representation . 2010,第4期

机译：使用贝叶斯网络对弱注释图像进行建模，分类和注释
4. Building a Multi-Modal Thesaurus from Annotated Images [C] . Frigui H., Caudill J. International Conference on Pattern Recognition . 2006

机译：从带注释图像构建多模态词库
5. Computational Methods for Segmentation of Multi-Modal Multi-Dimensional Cardiac Images [D] . Dangi, Shusil . 2019

机译：多模态多维心脏图像分割的计算方法
6. Building and Querying RDF/OWL Database of Semantically Annotated Nuclear Medicine Images [O] . Kyung Hoon Hwang, Haejun Lee, Geon Koh, 2017

机译：语义标注核医学图像RDF / OWL数据库的建立和查询
7. Time-resolved and Multi-modal Evaluation of Building Stone Weathering – New Advances in 4D Imaging and Analysis [O] . Jan Dewanckele, Tim De Kock, Wesley De Boever, 2020

机译：建筑石风化的时间分辨和多模态评估 - 4D成像和分析的新进展
8. Multi-Modal Retrieval of Trademark Images Using Global Similarity [R] . Ravela, S. , Manmatha, R. 2005

机译：利用全局相似性进行商标图像的多模态检索

Building a Multi-Modal Thesaurus from Annotated Images

摘要

著录项

相似文献

相关主题

期刊订阅