首页> 外文会议>IEEE International Conference on Cognitive Informatics Cognitive Computing >Using the 5th dimensions of human visual perception to inspire automated edge and texture segmentation: A fuzzy spatial-taxon approach
【24h】

Using the 5th dimensions of human visual perception to inspire automated edge and texture segmentation: A fuzzy spatial-taxon approach

机译:使用人类视觉感知的第5个维度来激发自动的边缘和纹理分割:一种模糊的空间分类法

获取原文

摘要

With the recent stunning success of machine learning, artificially intelligent machine vision research falls (roughly) into two camps: the big data camp and cognitive informatics camp. Big data uses statistical methods to discover latent structures that emerge from the co-occurrences of relevant features when sampling over enormous quantities of data. The cognitive informatics methods design computer vision systems to mimic human cognition. Though some visual latent features that emerge from deep learning networks, mimic mammalian visual detectors, as of yet the information processing mechanisms (analogous to human psychophysical mechanisms) remain hidden within the complexity of the deep nets. Furthermore, the sampling requirements of big data systems require limiting samples to pre-processed sets, such as SHIFT (shift invariant feature transform (Lowe 1999)). Techniques, such as the ones introduced in this paper, provide fast cognitively relevant methods for selecting samples and reducing the number of candidate features. The approach described in this paper live squarely in the camp of designing computer vision A.I. to mimics human cognitive processes. I introduce a novel definition of edges, based on human hierarchical scene perception. Hierarchical scene perception views vision within the 5 dimensions of horizontal & vertical position, depth, time and scene abstraction level (spatial-taxon). Fuzzy inference selects candidate edge elements using the Gestalt psychology principal of good curvilinear continuation, proximity and edges attachment. Spatial-taxon inference infers an edge outline for each level of abstraction within the scene architecture. The system was tested on 60 natural images and the results provide edges more aligned with human intuition of what edges should look like. ROC plots indicate solid performance, with the majority of human subjects rating the edge detection as high quality. The inferred edges are consistent with the finding of neurons responsive to proto-object boundaries in the visual cortex.
机译:随着近来机器学习的惊人成功,人工智能机器视觉研究大致分为两个阵营:大数据阵营和认知信息学阵营。大数据使用统计方法来发现在对大量数据进行采样时从相关特征的共现中出现的潜在结构。认知信息学方法设计计算机视觉系统来模仿人类的认知。尽管从深度学习网络中出现了一些潜在的视觉特征,但模仿哺乳动物的视觉检测器至今,信息处理机制(类似于人类的心理物理机制)仍然隐藏在深层网络的复杂性内。此外,大数据系统的采样要求要求将样本限制为预处理集,例如SHIFT(移位不变特征变换(Lowe 1999))。如本文介绍的技术,为选择样本和减少候选特征的数量提供了快速的认知相关方法。本文描述的方法正好存在于设计计算机视觉AI的阵营中。模仿人类的认知过程。我介绍了基于人的分层场景感知的边缘的新颖定义。分层场景感知可在水平和垂直位置,深度,时间和场景抽象级别(空间分类)的5个维度内查看视觉。模糊推理使用完好的曲线连续性,邻近性和边缘附着性的格式塔心理学原则选择候选边缘元素。空间分类单元推断可为场景体系结构中的每个抽象级别推断出边缘轮廓。该系统在60张自然图像上进行了测试,结果提供的边缘与人类对边缘的外观的直觉更加吻合。 ROC曲线表明性能稳定,大多数人类受试者将边缘检测评为高质量。推断的边缘与对视觉皮层中的原始对象边界有反应的神经元的发现相一致。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号