首页> 外文学位 >Learning to Synthesize and Manipulate Natural Images
【24h】

Learning to Synthesize and Manipulate Natural Images

机译:学习合成和处理自然图像

获取原文
获取原文并翻译 | 示例

摘要

Humans are avid consumers of visual content. Every day, people watch videos, play digital games and share photos on social media. However, there is an asymmetry---while everybody is able to consume visual data, only a chosen few are talented enough to effectively express themselves visually. For the rest of us, most attempts at creating or manipulating realistic visual content end up quickly "falling off'' the manifold of natural images. In this thesis, we investigate a number of data-driven approaches for preserving visual realism while creating and manipulating photographs. We use these methods as training wheels for visual content creation. We first propose to model visual realism directly from large-scale natural images. We then define a class of image synthesis and manipulation operations, constraining their outputs to look realistic according to the learned models. The presented methods not only help users easily synthesize more visually appealing photos but also enable new visual effects not possible before this work.;Part I describes discriminative methods for modeling visual realism and photograph aesthetics. Directly training these models requires expensive human judgments. To address this, we adopt active and unsupervised learning methods to reduce annotation costs. We then apply the learned model to various graphics tasks, such as automatically generating image composites and choosing the best-looking portraits from a photo album.;Part II presents approaches that directly model the natural image manifold via generative models and constrain the output of a photo editing tool to lie on this manifold. We build real-time data-driven exploration and editing interfaces based on both simpler image averaging models and more recent deep models.;Part III combines the discriminative learning and generative modeling into an end-to-end image-to-image translation framework, where a network is trained to map inputs (such as user sketches) directly to natural looking results. We present a new algorithm that can learn the translation in the absence of paired training data, as well as a method for producing diverse outputs given the same input image. These methods enable many new applications, such as turning user sketches into photos, season transfer, object transfiguration, photo style transfer, and generating real photographs from painting and computer graphics renderings.
机译:人类是视觉内容的狂热消费者。人们每天都在社交媒体上观看视频,玩数字游戏并分享照片。但是,这是不对称的-尽管每个人都可以使用视觉数据,但只有少数人有才能有效地以视觉方式表达自己。对于我们其他人来说,大多数创建或操纵真实视觉内容的尝试最终都会很快“落空”自然图像的多样性,因此,本文研究了许多在创建和操纵时保持视觉真实性的数据驱动方法。我们将这些方法用作创建视觉内容的训练轮,首先建议直接从大型自然图像中建模视觉逼真度,然后定义一类图像合成和操纵操作,根据它们的输出约束其输出看起来逼真。提出的方法不仅可以帮助用户轻松合成更具视觉吸引力的照片,而且还可以实现这项工作之前无法实现的新视觉效果。第一部分介绍了用于建模视觉真实感和照片美学的判别方法,直接训练这些模型需要昂贵的人工判断为了解决这个问题,我们采用主动和无监督的学习方法来减少注释sts。然后,我们将学习到的模型应用于各种图形任务,例如自动生成图像合成并从相册中选择最漂亮的肖像。第二部分介绍了通过生成模型直接对自然图像流形进行建模并限制图像输出的方法。照片编辑工具位于此流形上。我们基于较简单的图像平均模型和较新的深度模型构建实时数据驱动的探索和编辑界面。第三部分将判别式学习和生成模型结合到端到端的图像到图像转换框架中,训练网络将输入(例如用户草图)直接映射到自然外观的结果。我们提出了一种新算法,可以在没有配对训练数据的情况下学习翻译,以及一种在给定相同输入图像的情况下产生多种输出的方法。这些方法启用了许多新应用程序,例如将用户草图转换为照片,季节转换,对象变形,照片样式转换以及从绘画和计算机图形渲染生成真实照片。

著录项

  • 作者

    Zhu, Junyan.;

  • 作者单位

    University of California, Berkeley.;

  • 授予单位 University of California, Berkeley.;
  • 学科 Computer science.
  • 学位 Ph.D.
  • 年度 2017
  • 页码 181 p.
  • 总页数 181
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号