Learning from Collective Intelligence: Feature Learning Using Social Images and Tags

Zhang Hanwang; Shang Xindi; Luan Huanbo; Wang Meng; Chua Tat-Seng

首页> 外文期刊>ACM transactions on multimedia computing communications and applications >Learning from Collective Intelligence: Feature Learning Using Social Images and Tags

【24h】

Learning from Collective Intelligence: Feature Learning Using Social Images and Tags

机译：向集体智慧学习：使用社交图像和标签进行特征学习

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Feature representation for visual content is the key to the progress of many fundamental applications such as annotation and cross-modal retrieval. Although recent advances in deep feature learning offer a promising route towards these tasks, they are limited in application domains where high-quality and large-scale training data are expensive to obtain. In this article, we propose a novel deep feature learning paradigm based on social collective intelligence, which can be acquired from the inexhaustible social multimedia content on the Web, in particular, largely social images and tags. Differing from existing feature learning approaches that rely on high-quality image-label supervision, our weak supervision is acquired by mining the visual-semantic embeddings from noisy, sparse, and diverse social image collections. The resultant image word embedding space can be used to (1) fine-tune deep visual models for low-level feature extractions and (2) seek sparse representations as high-level cross-modal features for both image and text. We offer an easy-to-use implementation for the-proposed paradigm, which is fast and compatible with any state-of-the-art deep architectures. Extensive experiments on several benchmarks demonstrate that the cross-modal features learned by our paradigm significantly outperforms others in various applications such as content based retrieval, classification, and image captioning.

机译：视觉内容的特征表示是许多基本应用（例如注释和跨模式检索）取得进展的关键。尽管深度特征学习的最新进展为实现这些任务提供了一条有希望的途径，但它们在获得高质量和大规模培训数据的成本昂贵的应用领域受到了限制。在本文中，我们提出了一种基于社会集体智慧的新颖的深度特征学习范例，该范例可以从网络上取之不尽的社交多媒体内容（尤其是大部分社交图像和标签）中获取。与现有的依靠高质量图像标签监督的特征学习方法不同，我们的弱监督是通过从嘈杂，稀疏和多样的社会图像集中挖掘视觉语义嵌入而获得的。生成的图像词嵌入空间可用于（1）微调用于低级特征提取的深度视觉模型，以及（2）寻求稀疏表示作为图像和文本的高级交叉模式特征。我们为拟议的范式提供了易于使用的实现，该实现快速且与任何最新的深度架构兼容。在多个基准上进行的广泛实验表明，我们的范式学习到的跨模式功能在各种应用（例如基于内容的检索，分类和图像字幕）中明显优于其他模式。

著录项

来源
《ACM transactions on multimedia computing communications and applications》 |2017年第1期|1.1-1.23|共23页
作者
Zhang Hanwang; Shang Xindi; Luan Huanbo; Wang Meng; Chua Tat-Seng;
展开▼
作者单位

Natl Univ Singapore, Sch Comp, Singapore 117417, Singapore;

Natl Univ Singapore, Sch Comp, Singapore 117417, Singapore;

Tsinghua Univ, Tsinghua Yuan 1, Beijing 100084, Peoples R China;

Hefei Univ Technol, Hefei 230009, Peoples R China;

Natl Univ Singapore, Sch Comp, Singapore 117417, Singapore;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Representation learning; visual-semantic embedding; cross-media analysis;

机译：表征学习;视觉语义嵌入;跨媒体分析;

相似文献

外文文献
中文文献
专利

1. Deriving collective intelligence from reviews on the social Web using a supervised learning approach [J] . Chen Wu Expert Systems with Application . 2011,第10期

机译：使用监督学习方法从社交网络上的评论中得出集体智慧
2. Hospital-Based Nurses’ Perceptions of the Adoption of Web 2.0 Tools for Knowledge Sharing, Learning, Social Interaction and the Production of Collective Intelligence [J] . Adela S.M Lau Journal of medical Internet research . 2011,第4期

机译：医院护士对采用Web 2.0工具进行知识共享，学习，社交互动和集体智慧产生的看法
3. The collective knowledge of social tags: Direct and indirect influences on navigation, learning, and information processing [J] . Ulrike Cress, Christoph Held, Joachim Kimmerle Computers & education . 2013,第1期

机译：社会标签的集体知识：对导航，学习和信息处理的直接和间接影响
4. Ensemble Learning, Social Choice and Collective Intelligence An Experimental Comparison of Aggregation Techniques [C] . Andrea Campagner, Davide Ciucci, Federico Cabitza International Conference on Modeling Decisions for Artificial Intelligence . 2020

机译：整合学习，社会选择和集体智力-聚合技术的实验比较
5. Image features and learning algorithms for biological, generic and social object recognition. [D] . Zhang, Wei. 2009

机译：用于生物，通用和社交对象识别的图像功能和学习算法。
6. Lung nodule malignancy classification using only radiologist-quantified image features as inputs to statistical learning algorithms: probing the Lung Image Database Consortium dataset with two statistical learning methods [O] . Matthew C. Hancock, Jerry F. Magnan 2016

机译：仅使用放射科医生量化的图像特征作为统计学习算法的输入的肺结节恶性分类：使用两种统计学习方法探查肺图像数据库联盟数据集

Learning from Collective Intelligence: Feature Learning Using Social Images and Tags

摘要

著录项

相似文献

相关主题

期刊订阅