Topic regression multi-modal Latent Dirichlet Allocation for image annotation

机译：用于图像标注的主题回归多模态潜在狄利克雷分配

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

We present topic-regression multi-modal Latent Dirich-let Allocation (tr-mmLDA), a novel statistical topic model for the task of image and video annotation. At the heart of our new annotation model lies a novel latent variable regression approach to capture correlations between image or video features and annotation texts. Instead of sharing a set of latent topics between the 2 data modalities as in the formulation of correspondence LDA in [2], our approach introduces a regression module to correlate the 2 sets of topics, which captures more general forms of association and allows the number of topics in the 2 data modalities to be different. We demonstrate the power of tr-mmLDA on 2 standard annotation datasets: a 5000-image subset of COREL and a 2687-image LabelMe dataset. The proposed association model shows improved performance over correspondence LDA as measured by caption perplexity.

机译：我们提出了主题回归多模态潜在狄利克-莱分配（tr-mmLDA），一种用于图像和视频注释任务的新型统计主题模型。我们新注释模型的核心是一种新颖的潜在变量回归方法，可捕获图像或视频特征与注释文本之间的相关性。我们的方法不是像[2]中对应LDA的公式那样在2个数据模式之间共享一组潜在主题，而是引入了一个回归模块来关联这2个主题集，该模块捕获了更一般的关联形式并允许数量2种数据方式中的主题设置有所不同。我们在2个标准注释数据集上演示了tr-mmLDA的功能：COREL的5000个图像子集和2687个图像的LabelMe数据集。所提出的关联模型显示出比对应的LDA更好的性能，该性能通过字幕的困惑度来衡量。

著录项

来源
《2010 IEEE Conference on Computer Vision and Pattern Recognition》|2010年|P.3408-3415|共8页
会议地点
作者
Putthividhy Duangmanee; Attias Hagai T.; Nagarajan Srikantan S.;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP391.41;
关键词

相似文献

外文文献
中文文献
专利

1. MMDF-LDA: An improved Multi-Modal Latent Dirichlet Allocation model for social image annotation [J] . Liu Zheng, Zhang Caiming, Chen Caixian Expert Systems with Application . 2018,第auga期

机译：MMDF-LDA：用于社交图像注释的改进的多模态潜在狄利克雷分配模型
2. Semantic Annotation of Satellite Images Using Latent Dirichlet Allocation [J] . Lienou M., Maitre H., Datcu M. Geoscience and Remote Sensing Letters, IEEE . 2010,第1期

机译：利用潜在狄利克雷分配的卫星图像语义标注
3. Class-specific Gaussian-multinomial latent Dirichlet allocation for image annotation [J] . Zhiming Qian, Ping Zhong, Runsheng Wang EURASIP journal on advances in signal processing . 2015,第1期

机译：类特定的高斯多项式潜在Dirichlet分配用于图像标注
4. Topic Regression Multi-Modal Latent Dirichlet Allocation for Image Annotation [C] . Duangmanee Putthividhya, Hagai T. Attias, Srikantan S. Nagarajan IEEE Conference on Computer Vision and Pattern Recognition . 2010

机译：主题回归多模态潜在Dirichlet分配用于图像注释
5. Performance of Latent Dirichlet Allocation with Different Topic and Document Structures [D] . Feng, Haotian. 2019

机译：不同主题和文档结构的潜在Dirichlet分配的性能
6. Public discourse and sentiment during the COVID 19 pandemic: Using Latent Dirichlet Allocation for topic modeling on Twitter [O] . Jia Xue, Junxiang Chen, Chen Chen, 2020

机译：Covid 19 Pandemery的公众话语和情绪：在推特上使用潜在的Dirichlet分配主题建模
7. Topic Regression Multi-Modal Latent Dirichlet Allocation for Image Annotation [O] . Duangmanee Putthividhya, Hagai T. Attias, Srikantan S. Nagarajan 2015

机译：用于图像标注的主题回归多模态潜在Dirichlet分配
8. Image Annotation and Topic Extraction Using Super-Word Latent Dirichlet Allocation. [R] . Noel, I. G. 2013

机译：基于super-Word Latent Dirichlet分配的图像标注与主题提取。

Topic regression multi-modal Latent Dirichlet Allocation for image annotation

摘要

著录项

相似文献

相关主题

期刊订阅