首页> 外文OA文献 >Multi-resolution image processing and learning for texture recognition and image enhancement
【2h】

Multi-resolution image processing and learning for texture recognition and image enhancement

机译:用于纹理识别和图像增强的多分辨率图像处理和学习

摘要

A general recognition framework is presented that consists of multi-resolution pyramidal feature-extraction and learning paradigms for classification. The system is presented in the context of the texture recognition task.In the feature extraction part of the system, an oriented Laplacian pyramid is used as an efficient filtering scheme to transform the input image to a more robust representation in the frequency and orientation space. An optimal technique is presented for computing a steerable representation of the pyramid. Steerability is used to generate a rotation-invariant input representation.In the learning stage of the system we focus on a rule-based probabilistic learning scheme. This information-theoretic technique is utilized to find the most informative correlations between the attributes and the output classes while producing probability estimates for the outputs. Both unsupervised and supervised learning are utilized. Apart from the rule-based approach we experiment with other non-parametric classifiers, such as the k-nearest neighbor classifier and the Backprop neural-network.We demonstrate experimentally that our scheme improves significantly upon the state-of-the-art both in rotation-invariant classification and in orientation estimation. A variety of applications are presented, including autonomous navigation scenarios and remote-sensing, as possible extensions for the texture recognition system. A generalization of the system to face-recognition is discussed.In the latter part of the thesis, a procedure for creating images with higher resolution than the sampling rate would allow is described. The enhancement algorithm augments the frequency content of the image by using a non-linearity that generates phase-coherent higher harmonics. The procedure utilizes the Laplacian pyramid image representation. Results are presented depicting the power-spectra augmentation and the visual enhancement of several images. Simplicity of computations and ease of implementation allow for real-time applications such as high-definition television (HDTV). An initial investigation is pursued to combine the enhancement scheme with pyramid coding schemes.
机译:提出了一个通用的识别框架,该框架由多分辨率金字塔特征提取和用于分类的学习范例组成。该系统是在纹理识别任务的背景下提出的。在系统的特征提取部分,定向拉普拉斯金字塔被用作有效的滤波方案,以将输入图像变换为频率和方向空间中更鲁棒的表示。提出了一种用于计算金字塔的可操纵表示的最佳技术。可操纵性用于生成旋转不变的输入表示形式。在系统的学习阶段,我们专注于基于规则的概率学习方案。利用这种信息理论技术,可以在属性和输出类别之间找到信息量最大的相关性,同时为输出生成概率估计。无监督和有监督的学习都被利用。除了基于规则的方法外,我们还尝试了其他非参数分类器,例如k最近邻分类器和Backprop神经网络。我们通过实验证明了我们的方案在以下两个方面都进行了重大改进:旋转不变分类和方向估计。提出了各种应用,包括自主导航场景和遥感,作为纹理识别系统的可能扩展。讨论了该系统对人脸识别的一般化。在论文的后半部分,描述了一种以比采样率所允许的分辨率更高的分辨率创建图像的过程。增强算法通过使用产生相位相干高次谐波的非线性来增强图像的频率内容。该过程利用拉普拉斯金字塔图像表示。呈现的结果描述了功率谱的增强和几幅图像的视觉增强。计算的简单性和易于实现的特性允许实时应用,例如高清电视(HDTV)。进行初步研究以将增强方案与金字塔编码方案相结合。

著录项

  • 作者

    Greenspan Hayit;

  • 作者单位
  • 年度 1994
  • 总页数
  • 原文格式 PDF
  • 正文语种
  • 中图分类

相似文献

  • 外文文献
  • 中文文献
  • 专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号