Inter-modality Image Synthesis and Recognition.

机译：跨模态图像合成与识别。

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

Inter-modality image synthesis and recognition has been a hot topic in computer vision. In real-world applications, there are diverse image modalities, such as sketch images for law enforcement and near infrared images for illumination invariant face recognition. Therefore, it is often useful to transform images from a modality to another or match images from different modalities, due to the difficulty of acquiring image data in some modality. These techniques provide large flexibility for computer vision applications.;In this thesis we study three problems: face sketch synthesis, example-based image stylization, and face sketch recognition.;For face sketch synthesis, we expand the frontier to synthesis from uncontrolled face photos. Previous methods only work under well controlled conditions. We propose a robust algorithm for synthesizing a face sketch from a face photo with lighting and pose variations. It synthesizes local sketch patches using a multiscale Markov Random Field (MRF) model. The robustness to lighting and pose variations is achieved with three components: shape priors specific to facial components to reduce artifacts and distortions, patch descriptors and robust metrics for selecting sketch patch candidates, and intensity compatibility and gradient compatibility to match neighboring sketch patches effectively. Experiments on the CUHK face sketch database and celebrity photos collected from the web show that our algorithm significantly improves the performance of the state-of-the-art.;For example-based image stylization, we provide an effective approach of transferring artistic effects from a template image to photos. Most existing methods do not consider the content and style separately. We propose a style transfer algorithm via frequency band decomposition. An image is decomposed into the low-frequency (LF), mid-frequency (MF), and high-frequency (HF) components, which describe the content, main style, and information along the boundaries. Then the style is transferred from the template to the photo in the MF and HF components, which is formulated as MRF optimization. Finally a reconstruction step combines the LF component of the photo and the obtained style information to generate the artistic result. Compared to the other algorithms, our method not only synthesizes the style, but also preserves the image content well. We demonstrate that our approach performs excellently in image stylization and personalized artwork in experiments.;For face sketch recognition, we propose a new direction based on learning face descriptors from data. Recent research has focused on transforming photos and sketches into the same modality for matching or developing advanced classification algorithms to reduce the modality gap between features extracted from photos and sketches. We propose a novel approach by reducing the modality gap at the feature extraction stage. A face descriptor based on coupled information-theoretic encoding is used to capture discriminative local face structures and to effectively match photos and sketches. Guided by maximizing the mutual information between photos and sketches in the quantized feature spaces, the coupled encoding is achieved by the proposed coupled information-theoretic projection forest. Experiments on the largest face sketch database show that our approach significantly outperforms the state-of-the-art methods.

机译：模态图像合成和识别已成为计算机视觉中的热门话题。在实际应用中，存在多种图像模式，例如用于执法的草图图像和用于照明不变的面部识别的近红外图像。因此，由于难以以某种方式获取图像数据，因此将图像从一种模态转换为另一种或匹配来自不同模态的图像通常是有用的。这些技术为计算机视觉应用提供了很大的灵活性。；本文研究了三个问题：人脸素描合成，基于实例的图像样式化和人脸素描识别。；对于人脸素描合成，我们将边界扩展到不受控制的人脸照片合成。先前的方法仅在受控条件下有效。我们提出了一种鲁棒的算法，用于根据具有光照和姿势变化的面部照片合成面部素描。它使用多尺度马尔可夫随机场（MRF）模型来合成局部草图补丁。照明和姿势变化的鲁棒性通过三个组件实现：特定于面部组件的形状先验以减少伪影和失真；补丁描述符和用于选择草图补丁候选的鲁棒度量；以及强度兼容性和渐变兼容性，以有效地匹配相邻草图补丁。在香港中文大学的面部素描数据库上进行的实验和从网络上收集的名人照片表明，我们的算法显着改善了最新技术的性能;例如基于图像的风格化，我们提供了一种有效的方法来从照片的模板图像。大多数现有方法不会单独考虑内容和样式。我们提出了一种通过频带分解的风格转移算法。图像被分解为低频（LF），中频（MF）和高频（HF）分量，它们沿边界描述了内容，主要样式和信息。然后将样式从模板转移到MF和HF组件中的照片，这被公式化为MRF优化。最后，重建步骤将照片的LF分量与所获得的样式信息结合起来以生成艺术效果。与其他算法相比，我们的方法不仅可以合成样式，而且可以很好地保留图像内容。我们证明了我们的方法在图像风格化和个性化艺术品实验中表现出色。;对于人脸素描识别，我们基于从数据中学习人脸描述符提出了一个新的方向。最近的研究集中在将照片和草图转换为相同的模态，以匹配或开发高级分类算法，以缩小从照片和草图中提取的特征之间的模态差距。我们通过减少特征提取阶段的模态差距提出了一种新颖的方法。基于耦合信息理论编码的面部描述符用于捕获具有区别性的局部面部结构，并有效地匹配照片和草图。通过最大化量化特征空间中照片和草图之间的相互信息，可以通过提出的耦合信息理论投影林实现耦合编码。在最大的人脸草图数据库上进行的实验表明，我们的方法明显优于最新方法。

著录项

作者
Zhang, Wei.;
展开▼
作者单位

The Chinese University of Hong Kong (Hong Kong).;

展开▼
授予单位 The Chinese University of Hong Kong (Hong Kong).;
学科 Computer science.
学位 Ph.D.
年度 2012
页码 162 p.
总页数 162
原文格式 PDF
正文语种 eng
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. 3D right ventricular endocardium segmentation in cardiac magnetic resonance images by using a new inter-modality statistical shape modelling method [J] . Piazzese Concetta, Carminati M. Chiara, Krause Rolf, Biomedical signal processing and control . 2020,第Apra期

机译：新的模态统计形状建模方法在心脏磁共振图像中进行3D右心室心内膜分割
2. Imaging of Herniated Discs of the Cervical Spine: Inter-Modality Differences between 64-Slice Multidetector CT and 1.5-T MRI [J] . Yi Ji Sook, Cha Jang Gyu, Han Jong Kyu, Korean journal of radiology : . 2015,第4期

机译：颈椎椎间盘突出的影像学：64层多层探测器CT和1.5-T MRI之间的模态差异
3. Pixel-based classification of high-resolution satellite images for crop-shelter coverage recognition. [J] . Arcidiacono C., Porto S. M. C. Acta Horticulturae . 2012,第937aPta2期

机译：基于像素的高分辨率卫星图像分类，用于农作物庇护所覆盖识别。
4. Non-isomorphic Inter-modality Graph Alignment and Synthesis for Holistic Brain Mapping [C] . Islem Mhiri, Ahmed Nebli, Mohamed Ali Mahjoub, International conference on information processing in medical imaging . 2021

机译：非同构型际曲线图对齐和合成整体脑映射
5. Interpolation of Low Resolution Images for Improved Accuracy in Human Face Recognition. [D] . Elazhari, Abbas. 2014

机译：低分辨率图像的插值可提高人脸识别的准确性。
6. Imaging of Herniated Discs of the Cervical Spine: Inter-Modality Differences between 64-Slice Multidetector CT and 1.5-T MRI [O] . Ji Sook Yi, Jang Gyu Cha, Jong Kyu Han, 2015

机译：颈椎椎间盘突出的影像学：64层多层探测器CT和1.5-T MRI之间的模态差异
7. Inter-modality image synthesis and recognition. [O] . 2012

机译：模态间图像合成和识别。

Inter-modality Image Synthesis and Recognition.

摘要

著录项

相似文献

相关主题

期刊订阅