Describing Visual Scenes using Transformed Dirichlet Processes

机译：使用变换的Dirichlet进程描述视觉场景

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Motivated by the problem of learning to detect and recognize objects with minimal supervision, we develop a hierarchical probabilistic model for the spatial structure of visual scenes. In contrast with most existing models, our approach explicitly captures uncertainty in the number of object instances depicted in a given image. Our scene model is based on the transformed Dirichlet process (TDP), a novel extension of the hierarchical DP in which a set of stochastically transformed mixture components are shared between multiple groups of data. For visual scenes, mixture components describe the spatial structure of visual features in an object-centered coordinate frame, while transformations model the object positions in a particular image. Learning and inference in the TDP, which has many potential applications beyond computer vision, is based on an empirically effective Gibbs sampler. Applied to a dataset of partially labeled street scenes, we show that the TDP's inclusion of spatial structure improves detection performance, flexibly exploiting partially labeled training images.

机译：通过学习检测和识别具有最小监督的问题的问题，我们为视觉场景的空间结构开发了一个分层概率模型。与大多数现有模型相比，我们的方法明确地在给定图像中描绘的对象实例的数量中明确地捕获不确定性。我们的场景模型基于变换的Dirichlet过程（TDP），其分层DP的新扩展，其中一组随机转换的混合组分在多组数据之间共享。对于视觉场景，混合组件描述了以环形坐标帧中的视觉特征的空间结构，而变换模拟特定图像中的对象位置。在TDP中的学习和推理，具有超出计算机视觉的许多潜在应用的，基于经验有效的GIBBS采样器。应用于部分标记的街道场景的数据集，我们表明TDP的空间结构包括检测性能，灵活地利用部分标记的训练图像。

著录项

来源
《Annual Conference on Neural Information Processing Systems》|2006年||共8页
会议地点
作者
Erik B. Sudderth; Antonio Torralba; William T. Freeman; Alan S. Willsky;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类信息处理（信息加工）;
关键词

相似文献

外文文献
中文文献
专利

1. Describing visual scenes using transformed objects and parts [J] . Sudderth EB, Torralba A, Freeman WT, International Journal of Computer Vision . 2008,第1a3期

机译：使用变换后的对象和零件描述视觉场景
2. Describing Visual Scenes Using Transformed Objects and Parts [J] . Erik B. Sudderth, Antonio Torralba, William T. Freeman, International Journal of Computer Vision . 2008,第1a3期

机译：使用变换的对象和零件描述视觉场景
3. Integrated visual vocabulary in latent Dirichlet allocation-based scene classification for IKONOS image [J] . Kusumaningrum Retno, Wei Hong, Manurung Ruli, Journal of Applied Remote Sensing . 2014,第Null期

机译：基于潜在狄利克雷分配的IKONOS图像场景分类中的集成视觉词汇
4. Describing Visual Scenes using Transformed Dirichlet Processes [C] . Erik B. Sudderth, Antonio Torralba, William T. Freeman, Annual Conference on Neural Information Processing Systems . 2006

机译：使用变换的Dirichlet进程描述视觉场景
5. Scene Classification Using Spatial Pyramid Matching and Hierarchical Dirichlet Processes. [D] . Yin, Haohui. 2010

机译：使用空间金字塔匹配和分级Dirichlet过程进行场景分类。
6. Describing Events: Changes in Eye Movements and Language Production Due to Visual and Conceptual Properties of Scenes [O] . Yulia Esaulova, Martina Penke, Sarah Dolscheid 2005

机译：描述事件：由于场景的视觉和概念特性导致的眼动和语言产生的变化
7. Describing Visual Scenes Using Transformed Objects and Parts [O] . E. B. Sudderth, A. Torralba, W. T. Freeman, 2005

机译：使用变换的对象和零件描述视觉场景

Describing Visual Scenes using Transformed Dirichlet Processes

摘要

著录项

相似文献

相关主题

期刊订阅