Building a Multi-Modal Thesaurus from Annotated Images

机译：从带注释图像构建多模态词库

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

We propose an unsupervised approach to learn associations between low-level visual features and keywords. We assume that a collection of images is available and that each image is globally annotated. The objective is to extract representative visual profiles that correspond to frequent homogeneous regions, and to associate them with keywords. These labeled profiles would be used to build a multi-modal thesaurus that could serve as a foundation for hybrid navigation and search algorithms. Our approach has two main steps. First, each image is coarsely segmented into regions, and visual features are extracted from each region. Second, the regions are categorized using a novel algorithm that performs clustering and feature weighting simultaneously. As a result, we obtain clusters of regions that share subsets of relevant features. Representatives from each cluster and their relevant visual and textual features would be used to build a thesaurus. The proposed approach is validated using a collection of 1169 images.

机译：我们提出了一种无监督的方法来学习低级视觉功能与关键字之间的关联。我们假设图像的集合可用，并且每个图像都是全局注释的。目的是提取对应于频繁均匀区域的代表性视觉曲线，并将它们与关键字相关联。这些标记的配置文件将用于构建一个多模态词库，可以作为混合导航和搜索算法的基础。我们的方法有两个主要步骤。首先，每个图像粗略地分段为区域，并且从每个区域提取视觉特征。其次，该区域使用一种新颖算法分类，该算法同时执行群集和特征加权。因此，我们获得共享相关特征的子集的区域集群。每个集群的代表及其相关的视觉和文本功能将用于构建一个词库。使用1169个图像的集合验证了所提出的方法。

著录项

来源
《International Conference on Pattern Recognition》|2006年||共4页
会议地点
作者
Frigui H.; Caudill J.;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP391.4-53;
关键词

相似文献

外文文献
中文文献
专利

1. Building and Querying RDF/OWL Database of Semantically Annotated Nuclear Medicine Images [J] . Hwang Kyung Hoon, Lee Haejun, Koh Geon, Journal of digital imaging: the official journal of the Society for Computer Applications in Radiology . 2017,第1期

机译：构建和查询语义注释的核医学形象的RDF / OWL数据库
2. Object-based multi-modal convolution neural networks for building extraction using panchromatic and multispectral imagery [J] . Neurocomputing . 2020,第Apra21期

机译：基于全色和多光谱图像的建筑物提取的基于对象的多模式卷积神经网络
3. Modeling, classifying and annotating weakly annotated images using Bayesian network [J] . Sabine Barrat, Salvatore Tabbone Journal of visual communication & image representation . 2010,第4期

机译：使用贝叶斯网络对弱注释图像进行建模，分类和注释
4. Building a Multi-Modal Thesaurus from Annotated Images [C] . Frigui H., Caudill J. Pattern Recognition, 2006. ICPR 2006 . 2006

机译：从带注释的图像构建多模态词库
5. Computational Methods for Segmentation of Multi-Modal Multi-Dimensional Cardiac Images [D] . Dangi, Shusil . 2019

机译：多模态多维心脏图像分割的计算方法
6. Building and Querying RDF/OWL Database of Semantically Annotated Nuclear Medicine Images [O] . Kyung Hoon Hwang, Haejun Lee, Geon Koh, 2017

机译：语义标注核医学图像RDF / OWL数据库的建立和查询
7. Time-resolved and Multi-modal Evaluation of Building Stone Weathering – New Advances in 4D Imaging and Analysis [O] . Jan Dewanckele, Tim De Kock, Wesley De Boever, 2020

机译：建筑石风化的时间分辨和多模态评估 - 4D成像和分析的新进展
8. Multi-Modal Retrieval of Trademark Images Using Global Similarity [R] . Ravela, S. , Manmatha, R. 2005

机译：利用全局相似性进行商标图像的多模态检索

Building a Multi-Modal Thesaurus from Annotated Images

摘要

著录项

相似文献

相关主题

期刊订阅