首页> 外文期刊>IEEE Transactions on Pattern Analysis and Machine Intelligence >The RGFF representational model: a system for the automatically learned partitioning of 'visual patterns' in digital images
【24h】

The RGFF representational model: a system for the automatically learned partitioning of 'visual patterns' in digital images

机译:RGFF表示模型:一种用于自动学习的数字图像“视觉模式”划分的系统

获取原文
获取原文并翻译 | 示例
       

摘要

This paper describes a system for the automatically learned partitioning of visual patterns in 2D images, based on sophisticated band-pass filtering with fixed scale and orientation sensitivity. The visual patterns are defined as the features which have the highest degree of alignment in the statistical structure across different frequency bands. The analysis reorganizes the image according to an invariance constraint in statistical structure and consists of three stages: pre-attentive stage, integration stage, and learning stage. The first stage takes the input image and performs filtering with log-Gabor filters. Based on their responses, activated filters which are selectively sensitive to patterns in the image are short listed. In the integration stage, common grounds between several activated sensors are explored. The filtered responses are analyzed through a family of statistics. For any given two activated filters, a distance between them is derived via distances between their statistics. The third stage performs cluster partitioning for learning the subspace of log-Gabor filters needed to partition the image data. The clustering is based on a dissimilarity measure intended to highlight scale and orientation invariance of the responses. The technique is illustrated on real and simulated data sets. Finally, this paper presents a computational visual distinctness measure computed from the image representational model based on visual patterns. Experiments are performed to investigate its relation to distinctness as measured by human observers.
机译:本文介绍了一种基于具有固定比例和方向敏感性的复杂带通滤波器,可以自动学习对2D图像中的视觉图案进行分区的系统。视觉模式定义为在不同频段的统计结构中具有最高对齐度的特征。该分析根据统计结构中的不变性约束重新组织图像,并包括三个阶段:注意前阶段,整合阶段和学习阶段。第一阶段获取输入图像,并使用log-Gabor滤波器执行滤波。根据它们的响应,列出了对图像中的图案有选择地敏感的已激活滤镜。在集成阶段,将探索几个激活的传感器之间的共同点。通过一系列统计数据分析过滤后的响应。对于任何给定的两个激活的过滤器,它们之间的距离是通过它们的统计量之间的距离得出的。第三阶段执行集群分区,以学习对图像数据进行分区所需的log-Gabor滤波器的子空间。聚类基于旨在强调响应的尺度和方向不变性的相异性度量。在真实和模拟数据集上说明了该技术。最后,本文提出了一种基于视觉模式的图像表示模型计算出的视觉清晰度计算量度。进行实验以研究其与人类观察者测得的清晰度之间的关系。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号