...
【24h】

Generic interactive pixel-level image editing

机译:通用交互式像素级图像编辑

获取原文
获取原文并翻译 | 示例
           

摘要

Several image editing methods have been proposed in the past decades, achieving brilliant results. The most sophisticated of them, however, require additional information per-pixel. For instance, dehazing requires a specific transmittance value per pixel, or depth of field blurring requires depth or disparity values per pixel. This additional per-pixel value is obtained either through elaborated heuristics or through additional control over the capture hardware, which is very often tailored for the specific editing application. In contrast, however, we propose a generic editing paradigm that can become the base of several different applications. This paradigm generates both the needed per-pixel values and the resulting edit at interactive rates, with minimal user input that can be iteratively refined. Our key insight for getting per-pixel values at such speed is to cluster them into superpixels, but, instead of a constant value per superpixel (which yields accuracy problems), we have a mathematical expression for pixel values at each superpixel: in our case, an order two multinomial per superpixel. This leads to a linear least-squares system, effectively enabling specific per-pixel values at fast speeds. We illustrate this approach in three applications: depth of field blurring (from depth values), dehazing (from transmittance values) and tone mapping (from brightness and contrast local values), and our approach proves both favorably interactive and accurate in all three. Our technique is also evaluated with a common dataset and compared favorably.
机译:在过去的几十年里提出了几种图像编辑方法,实现了辉煌的结果。但是,它们最复杂的是每个像素的额外信息。例如,去吸附需要每个像素的特定透射率值,或者场模糊的景深需要每个像素的深度或视差值。通过详细的启发式或通过对捕获硬件的额外控制来获得此额外的每个像素值,这通常是针对特定编辑应用程序量身定制的。然而,相比之下,我们提出了一种通用的编辑范例,可以成为几种不同应用的基础。此范例以交互式速率生成所需的每个像素值和结果编辑,具有最小的用户输入,可以迭代地精制。我们以这种速度获取每个像素值的关键洞察力是将它们聚集到超像素中,但是,而不是每个超像素的恒定值(产生精度问题),我们在每个Superpixel处具有像素值的数学表达式:在我们的情况下,每个超像素的两个多项式。这导致线性最小二乘系统,以快速速度有效地实现特定的每个像素值。我们在三种应用中说明了这种方法:场景模糊(从深度值),脱落(来自透射值)和音调映射(从亮度和对比度局部值),以及我们的方法在所有三个中证明了有利的互动和准确。我们的技术也用常见的数据集进行评估,并比较有利。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号