首页> 外文会议>SPIE Commercial + Scientific Sensing and Imaging Conference >Processing Global and Local Features in Convolutional Neural Network (CNN) and Primate Visual Systems
【24h】

Processing Global and Local Features in Convolutional Neural Network (CNN) and Primate Visual Systems

机译:在卷积神经网络(CNN)和灵长类视觉系统中处理全局和局部特征

获取原文

摘要

In the human visual system, visible objects are recognized by features, which can be classified into local features that are based on their simple components (i.e., line segment, angle, color, etc.) and global features that are based on the whole objects (i.e., connectivity, number of holes, etc.). Over the past half century, anatomical, physiological, behavioral and computational studies of the visual systems have led to a generally accepted model of vision, which starts at processing local features in the early stages of the visual pathways, followed by integrating them to global features in the later stages of the visual pathways. However, this popular local-to-global model has been challenged by a set of experiments showing that the visual systems in humans, non-human primates and honey bees are more sensitive to global features than local features. These 'global-first' studies further motivated developing new paradigms and approaches to understand human vision and build new vision models. In this study, we started a new series of experiments that examine how two representative pre-trained Convolutional Neural Networks (CNN) (AlexNet and VGG-19) process local and global features. The CNNs were trained to classify geometric shapes into two categories based on local features (e.g., triangle, square and circle) or a global feature (e.g., having a hole). In contrast to the biological visual systems, the CNNs were more effective at classifying images based on local features than the global feature. We further showed that adding distractors greatly lowered the performance of the CNNs, again different from the biological visual systems. Ongoing studies will extend these analyses to other geometrical invariants and internal representations of the CNNs. The overarching goal is to use the powerful CNNs as a tool to gain insights into the biological visual systems, including that of humans and non-human primates.
机译:在人类视觉系统中,可见对象被特征识别,可以将其分为基于其简单成分(即线段,角度,颜色等)的局部特征和基于整个对象的全局特征(即连通性,孔数等)。在过去的半个世纪中,视觉系统的解剖学,生理学,行为学和计算研究已导致人们普遍接受的视觉模型,该模型首先在视觉通路的早期阶段处理局部特征,然后将其整合到全局特征中。在视觉通路的后期。但是,此流行的局部到全局模型已受到一组实验的挑战,这些实验表明,人类,非人类灵长类动物和蜜蜂的视觉系统对全局特征比对局部特征更敏感。这些“全球第一”的研究进一步激发了开发新的范式和方法来理解人类视觉并建立新的视觉模型。在这项研究中,我们开始了一系列新的实验,研究了两个代表性的预训练卷积神经网络(CNN)(AlexNet和VGG-19)如何处理局部和全局特征。对CNN进行了训练,可以根据局部特征(例如三角形,正方形和圆形)或整体特征(例如具有孔)将几何形状分为两类。与生物视觉系统相比,CNN在基于局部特征的图像分类上比全局特征更有效。我们进一步表明,添加干扰物会大大降低CNN的性能,这又与生物视觉系统不同。正在进行的研究会将这些分析扩展到CNN的其他几何不变式和内部表示形式。首要目标是使用功能强大的CNN作为一种工具,以深入了解生物视觉系统,包括人类和非人类灵长类动物。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号