首页> 外文学位 >Inter-modality Image Synthesis and Recognition.
【24h】

Inter-modality Image Synthesis and Recognition.

机译:跨模态图像合成与识别。

获取原文
获取原文并翻译 | 示例

摘要

Inter-modality image synthesis and recognition has been a hot topic in computer vision. In real-world applications, there are diverse image modalities, such as sketch images for law enforcement and near infrared images for illumination invariant face recognition. Therefore, it is often useful to transform images from a modality to another or match images from different modalities, due to the difficulty of acquiring image data in some modality. These techniques provide large flexibility for computer vision applications.;In this thesis we study three problems: face sketch synthesis, example-based image stylization, and face sketch recognition.;For face sketch synthesis, we expand the frontier to synthesis from uncontrolled face photos. Previous methods only work under well controlled conditions. We propose a robust algorithm for synthesizing a face sketch from a face photo with lighting and pose variations. It synthesizes local sketch patches using a multiscale Markov Random Field (MRF) model. The robustness to lighting and pose variations is achieved with three components: shape priors specific to facial components to reduce artifacts and distortions, patch descriptors and robust metrics for selecting sketch patch candidates, and intensity compatibility and gradient compatibility to match neighboring sketch patches effectively. Experiments on the CUHK face sketch database and celebrity photos collected from the web show that our algorithm significantly improves the performance of the state-of-the-art.;For example-based image stylization, we provide an effective approach of transferring artistic effects from a template image to photos. Most existing methods do not consider the content and style separately. We propose a style transfer algorithm via frequency band decomposition. An image is decomposed into the low-frequency (LF), mid-frequency (MF), and high-frequency (HF) components, which describe the content, main style, and information along the boundaries. Then the style is transferred from the template to the photo in the MF and HF components, which is formulated as MRF optimization. Finally a reconstruction step combines the LF component of the photo and the obtained style information to generate the artistic result. Compared to the other algorithms, our method not only synthesizes the style, but also preserves the image content well. We demonstrate that our approach performs excellently in image stylization and personalized artwork in experiments.;For face sketch recognition, we propose a new direction based on learning face descriptors from data. Recent research has focused on transforming photos and sketches into the same modality for matching or developing advanced classification algorithms to reduce the modality gap between features extracted from photos and sketches. We propose a novel approach by reducing the modality gap at the feature extraction stage. A face descriptor based on coupled information-theoretic encoding is used to capture discriminative local face structures and to effectively match photos and sketches. Guided by maximizing the mutual information between photos and sketches in the quantized feature spaces, the coupled encoding is achieved by the proposed coupled information-theoretic projection forest. Experiments on the largest face sketch database show that our approach significantly outperforms the state-of-the-art methods.
机译:模态图像合成和识别已成为计算机视觉中的热门话题。在实际应用中,存在多种图像模式,例如用于执法的草图图像和用于照明不变的面部识别的近红外图像。因此,由于难以以某种方式获取图像数据,因此将图像从一种模态转换为另一种或匹配来自不同模态的图像通常是有用的。这些技术为计算机视觉应用提供了很大的灵活性。;本文研究了三个问题:人脸素描合成,基于实例的图像样式化和人脸素描识别。;对于人脸素描合成,我们将边界扩展到不受控制的人脸照片合成。先前的方法仅在受控条件下有效。我们提出了一种鲁棒的算法,用于根据具有光照和姿势变化的面部照片合成面部素描。它使用多尺度马尔可夫随机场(MRF)模型来合成局部草图补丁。照明和姿势变化的鲁棒性通过三个组件实现:特定于面部组件的形状先验以减少伪影和失真;补丁描述符和用于选择草图补丁候选的鲁棒度量;以及强度兼容性和渐变兼容性,以有效地匹配相邻草图补丁。在香港中文大学的面部素描数据库上进行的实验和从网络上收集的名人照片表明,我们的算法显着改善了最新技术的性能;例如基于图像的风格化,我们提供了一种有效的方法来从照片的模板图像。大多数现有方法不会单独考虑内容和样式。我们提出了一种通过频带分解的风格转移算法。图像被分解为低频(LF),中频(MF)和高频(HF)分量,它们沿边界描述了内容,主要样式和信息。然后将样式从模板转移到MF和HF组件中的照片,这被公式化为MRF优化。最后,重建步骤将照片的LF分量与所获得的样式信息结合起来以生成艺术效果。与其他算法相比,我们的方法不仅可以合成样式,而且可以很好地保留图像内容。我们证明了我们的方法在图像风格化和个性化艺术品实验中表现出色。;对于人脸素描识别,我们基于从数据中学习人脸描述符提出了一个新的方向。最近的研究集中在将照片和草图转换为相同的模态,以匹配或开发高级分类算法,以缩小从照片和草图中提取的特征之间的模态差距。我们通过减少特征提取阶段的模态差距提出了一种新颖的方法。基于耦合信息理论编码的面部描述符用于捕获具有区别性的局部面部结构,并有效地匹配照片和草图。通过最大化量化特征空间中照片和草图之间的相互信息,可以通过提出的耦合信息理论投影林实现耦合编码。在最大的人脸草图数据库上进行的实验表明,我们的方法明显优于最新方法。

著录项

  • 作者

    Zhang, Wei.;

  • 作者单位

    The Chinese University of Hong Kong (Hong Kong).;

  • 授予单位 The Chinese University of Hong Kong (Hong Kong).;
  • 学科 Computer science.
  • 学位 Ph.D.
  • 年度 2012
  • 页码 162 p.
  • 总页数 162
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号