首页> 外文学位 >Scene reconstruction and visualization from Internet photo collections.
【24h】

Scene reconstruction and visualization from Internet photo collections.

机译:互联网照片集的场景重建和可视化。

获取原文
获取原文并翻译 | 示例

摘要

The Internet is becoming an unprecedented source of visual information, with billions of images instantly accessible through image search engines such as Google Images and Flickr. These include thousands of photographs of virtually every famous place, taken from a multitude of viewpoints, at many different times of day, and under a variety of weather conditions. This thesis addresses the problem of leveraging such photos to create new 3D interfaces for virtually exploring our world.;One key challenge is that recreating 3D scenes from photo collections requires knowing where each photo was taken. This thesis introduces new computer vision techniques that robustly recover such information from photo collections without requiring GPS or other instrumentation. These methods are the first to be demonstrated on Internet imagery, and show that 3D reconstruction techniques can be successfully applied to this rich, largely untapped resource. For this problem scale is a particular concern, as Internet collections can be extremely large. I introduce an efficient reconstruction algorithm that selects a small skeletal set of images as a preprocess. This approach can reduce reconstruction time by an order of magnitude with little or no loss in completeness or accuracy.;A second challenge is to build interfaces that take these reconstructions and provide effective scene visualizations. Towards this end, I describe two new 3D user interfaces. Photo Tourism is a 3D photo browser with new geometric controls for moving between photos. These include zooming in to find details, zooming out for more context, and selecting an image region to find photos of an object. The second interface, Pathfinder, takes advantage of the fact that people tend to take photos of interesting views and along interesting paths. Pathfinder creates navigation controls tailored to each location by analyzing the distribution of photos to discover such characteristic views and paths. These controls make it easy to find and explore the important parts of each scene.;Together these techniques enable the automatic creation of 3D experiences for famous sites. A user simply enters relevant keywords and the system automatically downloads images, reconstructs the site, derives navigation controls, and provides an immersive interface.
机译:互联网正在成为前所未有的视觉信息来源,数十亿的图像可通过图像搜索引擎(例如Google图像和Flickr)立即访问。这些照片包括几千张几乎每个著名景点的照片,这些照片是在一天的许多不同时间,在各种天气条件下从多种角度拍摄的。本文解决了利用此类照片创建新的3D界面来虚拟探索我们的世界的问题。一个主要挑战是,从照片集中重新创建3D场景需要知道每张照片的拍摄地点。本文介绍了新的计算机视觉技术,该技术可从照片集中可靠地恢复此类信息,而无需GPS或其他仪器。这些方法是第一个在Internet影像上展示的方法,它们表明3D重建技术可以成功地应用于这种丰富的,尚未开发的资源。对于这个问题,规模尤其值得关注,因为Internet的馆藏可能非常庞大。我介绍一种有效的重建算法,该算法选择一小幅图像骨架作为预处理。这种方法可以将重建时间减少一个数量级,而完整性或准确性几乎没有损失。第二个挑战是建立进行这些重建并提供有效场景可视化的界面。为此,我描述了两个新的3D用户界面。 Photo Tourism是3D照片浏览器,具有用于在照片之间移动的新几何控件。其中包括放大以查找细节,缩小以获取更多上下文以及选择图像区域以查找对象的照片。第二个界面是“路径查找器”,它利用了人们倾向于沿有趣的路径拍照的事实。探路者通过分析照片的分布来发现适合每个位置的导航控件,以发现这种具有特色的视图和路径。这些控件使查找和探索每个场景的重要部分变得很容易。这些技术一起使自动创建著名景点的3D体验成为可能。用户只需输入相关的关键字,系统就会自动下载图像,重建站点,导出导航控件并提供沉浸式界面。

著录项

  • 作者

    Snavely, Keith N.;

  • 作者单位

    University of Washington.;

  • 授予单位 University of Washington.;
  • 学科 Computer Science.
  • 学位 Ph.D.
  • 年度 2008
  • 页码 192 p.
  • 总页数 192
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类 自动化技术、计算机技术;
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号