首页> 外文学位 >Content-Aware Manipulations for Image and Video Collections.
【24h】

Content-Aware Manipulations for Image and Video Collections.

机译:图像和视频集合的内容感知操作。

获取原文
获取原文并翻译 | 示例

摘要

Digital photography and videography have become ubiquitous. With ever-cheaper and more capable cameras, found in dedicated devices and, increasingly, in multipurpose smartphones and tablets, it is now easier than ever for the casual user to generate their own dense stream of personal multimedia data. With popular photo and video sharing sites, like Flickr, Facebook, and YouTube, users can share their images and video with the world, making for a vast amount of multimedia data stored at home and on the web. This flood of data presents many challenges, particularly for the non-professional, to manage their data, for example, to apply simple photographic adjustments, or to find interesting shots worth keeping among megabytes of data from even a short weekend trip. At the same time, researchers have a unique opportunity to exploit the vast amount of publicly available multimedia data to make graphics tasks easier for the individual.;This dissertation presents work that seeks to address some of these challenges, and, where possible, exploit existing data sets to do so. First, we discuss a general approach that finds images similar to a given input from among a collection of photographs, from which various task-specific properties are transferred to the input. We demonstrate this basic approach in two distinct settings—image restoration and CG image enhancement. Next, we focus on collections of video. We first present a method for efficiently browsing and summarizing collections of related videos. Our approach is based on a simple pairwise video alignment that identities a relevant sequence of video clips that best matches an input video. Finally, we discuss our work on replacing facial performances in video that requires no special hardware and can be used to retarget existing footage to synthesize new performances.
机译:数码摄影和录像已经无处不在。借助专用设备中越来越便宜,功能更强大的相机,以及越来越多的多功能智能手机和平板电脑中的相机,休闲用户现在比以往任何时候都更容易生成自己的密集个人多媒体数据流。通过Flickr,Facebook和YouTube等流行的照片和视频共享网站,用户可以与世界共享他们的图像和视频,从而可以在家庭和网络上存储大量的多媒体数据。大量的数据带来了许多挑战,特别是对于非专业人士而言,要管理其数据,例如应用简单的摄影调整,或者从短短的周末旅行中找到值得保留的兆字节数据,就值得挑战。同时,研究人员有一个独特的机会来利用大量公开可用的多媒体数据,从而使个人的图形任务变得更加容易。本论文提出了旨在解决其中一些挑战并在可能的情况下利用现有技术的工作。数据集来做到这一点。首先,我们讨论一种通用方法,该方法从一组照片中查找与给定输入相似的图像,然后将各种特定于任务的属性从这些图像转移到输入中。我们在两种不同的设置中演示了这种基本方法-图像还原和CG图像增强。接下来,我们重点介绍视频集合。我们首先提出一种有效浏览和汇总相关视频集合的方法。我们的方法基于简单的成对视频对齐方式,该对齐方式标识与输入视频最匹配的视频片段的相关序列。最后,我们讨论了在不需要特殊硬件的情况下替换视频中的面部表演的工作,可以将其用于重新定位现有素材以合成新的表演。

著录项

  • 作者

    Dale, Kevin.;

  • 作者单位

    Harvard University.;

  • 授予单位 Harvard University.;
  • 学科 Computer Science.
  • 学位 Ph.D.
  • 年度 2012
  • 页码 122 p.
  • 总页数 122
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

  • 入库时间 2022-08-17 11:43:27

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号