【24h】

Interactive Scene Generation via Scene Graphs with Attributes

机译:通过场景图与属性的交互式场景

获取原文

摘要

We introduce a simple yet expressive image generation method. On the one hand, it does not require the user to paint the masks or define a bounding box of the various objects, since the model does it by itself. On the other hand, it supports defining a coarse location and size of each object. Based on this, we offer a simple, interactive GUI, that allows a layman user to generate diverse images effortlessly. From a technical perspective, we introduce a dual embedding of layout and appearance. In this scheme, the location, size, and appearance of an object can change independently of each other. This way, the model is able to generate innumerable images per scene graph, to better express the intention of the user. In comparison to previous work, we also offer better quality and higher resolution outputs. This is due to a superior architecture, which is based on a novel set of discriminators. Those discriminators better constrain the shape of the generated mask, as well as capturing the appearance encoding in a counterfactual way. Our code is publicly available at https://www.github.com/ashual/scene_generation.
机译:我们介绍了一种简单又富有富有富有富有富有仿真性的图像生成方法。一方面,它不需要用户绘制掩码或定义各种对象的边界框,因为模型本身是这样做的。另一方面,它支持定义每个对象的粗糙位置和大小。基于此,我们提供了一个简单的互动GUI,允许外行用户毫不费力地生成各种图像。从技术角度来看,我们介绍了布局和外观的双重嵌入。在该方案中,对象的位置,大小和外观可以彼此独立地改变。这样,该模型能够每场景图生成无数图像,更好地表达用户的意图。与以前的工作相比,我们还提供更好的质量和更高的分辨率输出。这是由于卓越的架构,这是基于新颖的一组鉴别器。这些鉴别器更好地限制了所生成的掩模的形状,以及以反应性方式捕获外观编码。我们的代码在https://www.github.com/ashual/scene_generation公开提供。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号