首页> 外文学位 >Scene Understanding Using Internet Photo Collections.
【24h】

Scene Understanding Using Internet Photo Collections.

机译:使用Internet照片集了解场景。

获取原文
获取原文并翻译 | 示例

摘要

With billions of photos now online, much computer vision research has been devoted to using Internet photo collections for tasks such as visualization, learning object category models, and 3D scene reconstruction. While most of this recent work leverages the sheer quantity of online images, I present several approaches for using the distribution of images (and associated metadata) to extract structured information about 3D scenes. In essence, I use online photo collections as a proxy for human perception in aggregate, treating each photo as a statement about the world and not just a source of visual data;I present three examples of information extraction leveraging the distribution of online photos from the photo-sharing site Flickr. First, I demonstrate the selection of canonical views of objects and scenes via a greedy image clustering algorithm. Second, I show how scenes can be decomposed into individual objects by using a cue based on the field-of-view of large numbers of images. Finally, I extract scene-scale human movement patterns from the distribution of photo sequences. Based on these projects, I demonstrate applications to scene summarization, browsing, image/object tagging, and visualization.;What objects and views do people find interesting? What is an object? How do people move around while exploring a scene? How do people frame their photos? In this thesis, I suggest a new way to answer such questions about the world, and our perception of it, using Internet photo collections.
机译:随着数十亿张照片现在在线上,许多计算机视觉研究已致力于将Internet照片集用于诸如可视化,学习对象类别模型和3D场景重建之类的任务。尽管最近的大部分工作都充分利用了在线图像的数量,但我提出了几种使用图像分布(和相关的元数据)来提取有关3D场景的结构化信息的方法。从本质上讲,我使用在线照片集作为人类感知的整体代理,将每张照片视为关于世界的陈述,而不仅仅是视觉数据的来源;我提供了三个示例的信息提取示例,这些信息利用了来自照片共享网站Flickr。首先,我通过贪婪的图像聚类算法演示了对象和场景的标准视图选择。其次,我展示了如何使用基于大量图像视场的提示将场景分解为单个对象。最后,我从照片序列的分布中提取出场景尺度的人体运动模式。在这些项目的基础上,我演示了场景摘要,浏览,图像/对象标记和可视化的应用。人们发现哪些对象和视图有趣?什么是物体?人们在探索场景时如何四处走动?人们如何构图?在这篇论文中,我提出了一种使用互联网照片集回答有关世界以及我们对世界的看法的新方法。

著录项

  • 作者

    Simon, Ian.;

  • 作者单位

    University of Washington.;

  • 授予单位 University of Washington.;
  • 学科 Computer Science.
  • 学位 Ph.D.
  • 年度 2011
  • 页码 104 p.
  • 总页数 104
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号