Describing Visual Scenes using Transformed Dirichlet Processes

机译：使用变换的狄利克雷过程描述视觉场景

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

Motivated by the problem of learning to detect and recognize objects with minimal supervision, we develop a hierarchical probabilistic model for the spatial structure of visual scenes. In contrast with most existing models, our approach explicitly captures uncertainty in the number of object instances depicted in a given image. Our scene model is based on the transformed Dirichlet process (TDP), a novel extension of the hierarchical DP in which a set of stochastically transformed mixture components are shared between multiple groups of data. For visual scenes, mixture components describe the spatial structure of visual features in an object-centered coordinate frame, while transformations model the object positions in a particular image. Learning and inference in the TDP, which has many potential applications beyond computer vision, is based on an empirically effective Gibbs sampler. Applied to a dataset of partially labeled street scenes, we show that the TDP's inclusion of spatial structure improves detection performance, flexibly exploiting partially labeled training images.

机译：出于学习在最少的监督下检测和识别对象的问题，我们针对视觉场景的空间结构开发了一个分层的概率模型。与大多数现有模型相比，我们的方法明确地捕获了给定图像中描述的对象实例数量的不确定性。我们的场景模型基于变换的Dirichlet过程（TDP），它是分层DP的新颖扩展，其中在多组数据之间共享一组随机变换的混合分量。对于视觉场景，混合分量描述了以对象为中心的坐标框中的视觉特征的空间结构，而变换则对特定图像中的对象位置进行建模。 TDP中的学习和推理基于计算机上有效的Gibbs采样器，在计算机视觉之外还有许多潜在的应用。应用于部分标记的街道场景的数据集，我们表明TDP包含的空间结构可提高检测性能，可以灵活地利用部分标记的训练图像。

著录项

来源
《Annual Conference on Neural Information Processing Systems(NIPS); 20051205-10; British Columbia(CA)》|2005年|P.1297-1304|共8页
会议地点 British Columbia(CA)
作者
Erik B. Sudderth; Antonio Torralba; William T. Freeman; Alan S. Willsky;
展开▼
作者单位

Department of Electrical Engineering and Computer Science Massachusetts Institute of Technology;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类信息处理（信息加工）;
关键词

相似文献

外文文献
中文文献
专利

1. Describing visual scenes using transformed objects and parts [J] . Sudderth EB, Torralba A, Freeman WT, International Journal of Computer Vision . 2008,第1a3期

机译：使用变换后的对象和零件描述视觉场景
2. Describing Visual Scenes Using Transformed Objects and Parts [J] . Erik B. Sudderth, Antonio Torralba, William T. Freeman, International Journal of Computer Vision . 2008,第1a3期

机译：使用变换的对象和零件描述视觉场景
3. Integrated visual vocabulary in latent Dirichlet allocation-based scene classification for IKONOS image [J] . Kusumaningrum Retno, Wei Hong, Manurung Ruli, Journal of Applied Remote Sensing . 2014,第Null期

机译：基于潜在狄利克雷分配的IKONOS图像场景分类中的集成视觉词汇
4. Describing Visual Scenes using Transformed Dirichlet Processes [C] . Erik B. Sudderth, Antonio Torralba, William T. Freeman, Annual Conference on Neural Information Processing Systems . 2006

机译：使用变换的Dirichlet进程描述视觉场景
5. Scene Classification Using Spatial Pyramid Matching and Hierarchical Dirichlet Processes. [D] . Yin, Haohui. 2010

机译：使用空间金字塔匹配和分级Dirichlet过程进行场景分类。
6. Describing Events: Changes in Eye Movements and Language Production Due to Visual and Conceptual Properties of Scenes [O] . Yulia Esaulova, Martina Penke, Sarah Dolscheid 2005

机译：描述事件：由于场景的视觉和概念特性导致的眼动和语言产生的变化
7. Describing Visual Scenes Using Transformed Objects and Parts [O] . E. B. Sudderth, A. Torralba, W. T. Freeman, 2005

机译：使用变换的对象和零件描述视觉场景

Describing Visual Scenes using Transformed Dirichlet Processes

摘要

著录项

相似文献

相关主题

期刊订阅