Learning to Synthesize and Manipulate Natural Images

机译：学习合成和处理自然图像

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

Humans are avid consumers of visual content. Every day, people watch videos, play digital games and share photos on social media. However, there is an asymmetry---while everybody is able to consume visual data, only a chosen few are talented enough to effectively express themselves visually. For the rest of us, most attempts at creating or manipulating realistic visual content end up quickly "falling off'' the manifold of natural images. In this thesis, we investigate a number of data-driven approaches for preserving visual realism while creating and manipulating photographs. We use these methods as training wheels for visual content creation. We first propose to model visual realism directly from large-scale natural images. We then define a class of image synthesis and manipulation operations, constraining their outputs to look realistic according to the learned models. The presented methods not only help users easily synthesize more visually appealing photos but also enable new visual effects not possible before this work.;Part I describes discriminative methods for modeling visual realism and photograph aesthetics. Directly training these models requires expensive human judgments. To address this, we adopt active and unsupervised learning methods to reduce annotation costs. We then apply the learned model to various graphics tasks, such as automatically generating image composites and choosing the best-looking portraits from a photo album.;Part II presents approaches that directly model the natural image manifold via generative models and constrain the output of a photo editing tool to lie on this manifold. We build real-time data-driven exploration and editing interfaces based on both simpler image averaging models and more recent deep models.;Part III combines the discriminative learning and generative modeling into an end-to-end image-to-image translation framework, where a network is trained to map inputs (such as user sketches) directly to natural looking results. We present a new algorithm that can learn the translation in the absence of paired training data, as well as a method for producing diverse outputs given the same input image. These methods enable many new applications, such as turning user sketches into photos, season transfer, object transfiguration, photo style transfer, and generating real photographs from painting and computer graphics renderings.

机译：人类是视觉内容的狂热消费者。人们每天都在社交媒体上观看视频，玩数字游戏并分享照片。但是，这是不对称的-尽管每个人都可以使用视觉数据，但只有少数人有才能有效地以视觉方式表达自己。对于我们其他人来说，大多数创建或操纵真实视觉内容的尝试最终都会很快“落空”自然图像的多样性，因此，本文研究了许多在创建和操纵时保持视觉真实性的数据驱动方法。我们将这些方法用作创建视觉内容的训练轮，首先建议直接从大型自然图像中建模视觉逼真度，然后定义一类图像合成和操纵操作，根据它们的输出约束其输出看起来逼真。提出的方法不仅可以帮助用户轻松合成更具视觉吸引力的照片，而且还可以实现这项工作之前无法实现的新视觉效果。第一部分介绍了用于建模视觉真实感和照片美学的判别方法，直接训练这些模型需要昂贵的人工判断为了解决这个问题，我们采用主动和无监督的学习方法来减少注释sts。然后，我们将学习到的模型应用于各种图形任务，例如自动生成图像合成并从相册中选择最漂亮的肖像。第二部分介绍了通过生成模型直接对自然图像流形进行建模并限制图像输出的方法。照片编辑工具位于此流形上。我们基于较简单的图像平均模型和较新的深度模型构建实时数据驱动的探索和编辑界面。第三部分将判别式学习和生成模型结合到端到端的图像到图像转换框架中，训练网络将输入（例如用户草图）直接映射到自然外观的结果。我们提出了一种新算法，可以在没有配对训练数据的情况下学习翻译，以及一种在给定相同输入图像的情况下产生多种输出的方法。这些方法启用了许多新应用程序，例如将用户草图转换为照片，季节转换，对象变形，照片样式转换以及从绘画和计算机图形渲染生成真实照片。

著录项

作者
Zhu, Junyan.;
展开▼
作者单位

University of California, Berkeley.;

展开▼
授予单位 University of California, Berkeley.;
学科 Computer science.
学位 Ph.D.
年度 2017
页码 181 p.
总页数 181
原文格式 PDF
正文语种 eng
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. Learning to Synthesize and Manipulate Natural Images [J] . Zhu Jun-Yan IEEE Computer Graphics and Applications . 2019,第2期

机译：学习合成和处理自然图像
2. Learning to Synthesize and Manipulate Natural Images [J] . Zhu Jun-Yan IEEE Computer Graphics and Applications . 2019,第2期

机译：学习综合和操纵自然图像
3. Supervised Learning Occurs in Visual Perceptual Learning of Complex Natural Images [J] . Frank Sebastian M., Qi Andrea, Ravasio Daniela, Current Biology: CB . 2020,第15期

机译：监督学习发生在复杂自然图像的视觉感知学习中
4. Synthesizing human-like sketches from natural images using a conditional convolutional decoder [C] . Moritz Kampelmühler, Axel Pinz IEEE Winter Conference on Applications of Computer Vision . 2020

机译：使用条件卷积解码器从自然图像合成人像草图
5. Reconstructing and Optimizing Natural Images Perceived by the Human Brain Based on Bayesian Deep Multi-View Learning [D] . Li, Xintong. 2021

机译：基于贝叶斯深度多视图学习的人脑重建和优化自然图像
6. Deep learning approach to classification of lung cytological images: Two-step training using actual and synthesized images by progressive growing of generative adversarial networks [O] . Atsushi Teramoto, Tetsuya Tsukamoto, Ayumi Yamada, 2020

机译：深层学习方法肺细胞学图像分类：使用实际和合成图像的两步训练通过逐步生长的生长生长育种网络
7. Learning to Synthesize and Manipulate Natural Images [O] . Jun-Yan Zhu, Jim Foley 2019

机译：学习综合和操纵自然图像

Learning to Synthesize and Manipulate Natural Images

摘要

著录项

相似文献

相关主题

期刊订阅