Mining Visual and Textual Data for Constructing a Multi-Modal Thesaurus

机译：用于构建多模态词库的挖掘视觉和文本数据

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

We propose an unsupervised approach to learn associations between continuous-valued attributes from different modalities. These associations are used to construct a multi-modal thesaurus that could serve as a foundation for inter-modality translation, and for hybrid navigation and search algorithms. We focus on extracting associations between visual features and textual keywords. Visual features consist of low-level attributes extracted from image content such as color, texture, and shape. Textual features consist of keywords that provide a description of the images. We assume that a collection of training images is available and that each image is globally annotated by few keywords. The objective is to extract representative visual profiles that correspond to frequent homogeneous regions, and to associate them with keywords. These profiles would be used to build the a multimodal thesaurus. The proposed approach was trained with a large collection of images, and the constructed thesaurus was used to label new images. Initial experiments indicate that we can achieve up to 71.9% relative improvement on captioning accuracy over the state-of-the-art.

机译：我们提出了一种无监督的方法来学习来自不同方式的连续属性之间的关联。这些关联用于构建可以作为模态转换的基础和混合导航和搜索算法的多模态词库。我们专注于提取可视化功能和文本关键字之间的关联。视觉功能由从图像内容提取的低级属性组成，例如颜色，纹理和形状。文本功能包括提供图像描述的关键字。我们假设可以使用培训图像的集合，并且每个图像都是少量关键词的全局注释。目的是提取对应于频繁均匀区域的代表性视觉曲线，并将它们与关键字相关联。这些简档将用于构建多式联运词库。所提出的方法培训了大量图像，并且建造的词库用于标记新图像。初始实验表明，在最先进的准确性上，我们可以获得高达71.9％的相对改善。

著录项

来源
《International Conference on Data Mining》|2007年||共6页
会议地点
作者
Hichem Frigui; Joshua Caudill;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP274.2-53;
关键词
Multimedia mining; Multi-modal thesaurus; Clustering; Feature weighting; Image annotation;

机译：多媒体挖掘;多模态词库;群集;功能加权;图像注释;

相似文献

外文文献
中文文献
专利

1. Textual analysis and visualization of research trends in data mining for electronic health records [J] . Jingfeng Chen, Wei Wei, Chonghui Guo, Health policy and technology. . 2017,第4期

机译：电子健康记录数据挖掘研究趋势的文本分析与可视化
2. Multi-objective evolutionary optimization for constructing neural networks for virtual reality visual data mining: application to geophysical prospecting. [J] . Valdes JJ, Barton AJ Neural Networks: The Official Journal of the International Neural Network Society . 2007,第4期

机译：用于构建虚拟现实视觉数据挖掘的神经网络的多目标进化优化：在地球物理勘探中的应用。
3. Constructing a Thesaurus for Information Retrieval in Full-Text Databases [J] . S. V. Zhmailo Automatic Documentation and Mathematical Linguistics . 2006,第5期

机译：在全文数据库中构建信息检索同义词库
4. Mining Visual and Textual Data for Constructing a Multi-Modal Thesaurus [C] . Hichem Frigui, Joshua Caudill International Conference on Data Mining . 2007

机译：用于构建多模态词库的挖掘视觉和文本数据
5. Haptic visualization: The use of multi-modal visualization to better interpret multi-variate data. [D] . Weissgerber, Doanna M. 2004

机译：触觉可视化：使用多模式可视化更好地解释多变量数据。
6. COVID-19 and Media datasets: Period- and location-specific textual data mining [O] . Mathieu Roche 2020

机译：Covid-19和媒体数据集：周期和位置特定的文本数据挖掘
7. Multi-objective evolutionary optimization for constructing neural networks for virtual reality visual data mining: Application to geophysical prospecting [O] . Valdue9s, Julio, Barton, Alan 2007

机译：用于虚拟现实视觉数据挖掘的构建神经网络的多目标进化优化：在地球物理勘探中的应用
8. Application of Text Mining and Data Visualization Techniques to Textual Corpus Exploration. [R] . Smith, J. R. 2018

机译：文本挖掘与数据可视化技术在语篇语料库探索中的应用。

Mining Visual and Textual Data for Constructing a Multi-Modal Thesaurus

摘要

著录项

相似文献

相关主题

期刊订阅