Discovering objects in images and videos.

机译：发现图像和视频中的对象。

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

This thesis presents a novel way of scene analysis in images and videos. Traditional scene analysis using object detection involves a lot of human labor for labeling the images, and also has the difficulty of handling a large number of objects categories. Our approach to scene analysis is unsupervised in nature. Given a video, we want to "discover" the objects of interest. No single labeled image is used to pre-train or initialize the system. Still, the system is able to discover the objects of interest. It works on a wide variety of videos and it can discover objects belonging to a large set of different categories. It works in crowded scenes with distracting background pattern and motion. It works in partial occlusions and total removal. The probabilistic framework consists of an appearance model and a motion model. The appearance model exploits the consistency of object parts in appearance across frames. The motion model exploits the motion continuity across frames. Together, they provide appearance and location estimates of the objects of interest. This framework provides a basis for higher level video content analysis tasks.

机译：本文提出了一种新颖的图像和视频场景分析方法。使用对象检测的传统场景分析需要大量的人工来标记图像，并且还具有处理大量对象类别的困难。我们的场景分析方法实际上是不受监督的。给定视频，我们想“发现”感兴趣的对象。没有单个标记的图像用于预训练或初始化系统。该系统仍然能够发现感兴趣的对象。它可以处理各种视频，并且可以发现属于大量不同类别的对象。它可以在拥挤的场景中工作，并具有分散注意力的背景图案和动作。它适用于部分遮挡和完全移除。概率框架由外观模型和运动模型组成。外观模型在整个框架中利用了对象零件在外观上的一致性。运动模型利用了跨帧的运动连续性。它们一起提供了感兴趣对象的外观和位置估计。该框架为更高级别的视频内容分析任务提供了基础。

著录项

作者
Liu, David.;
展开▼
作者单位

Carnegie Mellon University.;

展开▼
授予单位 Carnegie Mellon University.;
学科 Computer Science.
学位 Ph.D.
年度 2008
页码 150 p.
总页数 150
原文格式 PDF
正文语种 eng
中图分类自动化技术、计算机技术;
关键词

相似文献

外文文献
中文文献
专利

1. Boosting image object retrieval and indexing by automatically discovered pseudo-objects [J] . Kuan-Ting Chen, Kuan-Hung Lin, Yin-Hsi Kuo, Journal of visual communication & image representation . 2010,第8期

机译：通过自动发现的伪对象来增强图像对象的检索和索引编制
2. Discovering hierarchical object models from captioned images [J] . Michael Jamieson, Yulia Eskin, Afsaneh Fazly, Computer vision and image understanding . 2012,第7期

机译：从字幕图像中发现分层对象模型
3. Hα DOTS: A CATALOG OF FAINT EMISSION-LINE OBJECTS DISCOVERED IN NARROWBAND IMAGES [J] . Jessica A. Kellar1, John J. Salzer2, Gary Wegner1, The Astrophysical journal . 2012,第6期

机译：Hα点：在窄带图像中发现的微弱发射线对象的目录
4. Learning to discover objects in RGB-D images using correlation clustering [C] . Firman Michael, Thomas Diego, Julier Simon, IEEE/RSJ International Conference on Intelligent Robots and Systems . 2013

机译：学习使用相关性聚类发现RGB-D图像中的对象
5. Feature Extraction in Sequential Multimedia Images: with Applications in Satellite Images and On-line Videos. [D] . Liang, Yu-Li. 2012

机译：顺序多媒体图像中的特征提取：在卫星图像和在线视频中的应用。
6. THINGS: A database of 1,854 object concepts and more than 26,000 naturalistic object images [O] . Martin N. Hebart, Adam H. Dickter, Alexis Kidder, 2012

机译：事物：包含1,854个对象概念和26,000多个自然主义对象图像的数据库
7. Figure 9: Still images from thermal imaging videos. [O] . -1

机译：图9：静止图像来自热成像视频。

Discovering objects in images and videos.

摘要

著录项

相似文献

相关主题

期刊订阅