首页> 外文学位 >Models of bottom-up and top-down visual attention.
【24h】

Models of bottom-up and top-down visual attention.

机译:自下而上和自上而下的视觉注意力模型。

获取原文
获取原文并翻译 | 示例

摘要

When we observe our visual environment, we do not perceive all its components as being equally interesting. Some objects automatically and effortlessly "pop-out" from their surroundings, that is, they draw our visual attention, in a " bottom-up" manner, towards them. In a first approximation, focal visual attention acts as a rapidly shiftable "spotlight," which allows only the selected information to reach higher levels of processing and representation. Most models of the bottom-up control of attention are based on the concept of a saliency map, that is, an explicit two-dimensional map that encodes the conspicuity of objects in the visual environment. Competition among neurons in this map gives rise to a single winning location that corresponds to the next attended target. Inhibiting this location automatically allows the system to attend to the next most salient location. A first body of work in this thesis describes a detailed computer implementation of such a scheme, focusing on the problem of combining information across modalities, here orientation, intensity and color information, in a purely stimulus-driven manner. The model is applied to common psychophysical stimuli as well as to very demanding visual search tasks. Its successful performance is used to address the extent to which the primate visual system carries out visual search via one or more such saliency maps and how this can be tested.; We next address the question of what happens once our attention is focused onto a restricted part of our visual field. There is mounting experimental evidence that attention is far more sophisticated than a simple feed-forward spatially-selective filtering process. Indeed, visual processing appears to be significantly different inside the attentional spotlight than outside. That is, in addition to its properties as a feed-forward information processing and transmission bottleneck, focal visual attention feeds back and locally modulates, in a "top-down" manner, the visual processing and representation of selected objects. The second body of work presented in this thesis is concerned with a detailed computational model of basic pattern vision in humans and its modulation by top-down attention. We start by acquiring a complete dataset of five different simple psychophysical experiments, including discriminations of contrast, orientation and spatial frequency of simple pattern stimuli by human observers. This experimental dataset places strict constraints on our model of early pattern vision. The model, however, is eventually able to reproduce the entire dataset while assuming plausible neurobiological components. The model is further applied to existing psychophysical data which demonstrates how top-down attention alters performance in these simple psychophysical discrimination experiments. Our model is able to quantitatively account for all observations by assuming that attention strengthens the non-linear cortical interactions among visual neurons.
机译:当我们观察视觉环境时,我们并不认为它的所有组件都同样有趣。一些对象会自动毫不费力地从周围环境中“弹出”,也就是说,它们以“自下而上”的方式吸引我们的目光。在第一近似中,焦点视觉注意力充当可快速移动的“聚光灯”,这仅允许所选信息达到更高的处理和表示水平。自下而上的注意力控制的大多数模型都是基于显着性图的概念,即显式的二维图,它编码视觉环境中对象的显眼性。在此地图中,神经元之间的竞争产生了一个与下一个参与目标相对应的获胜位置。自动禁止此位置可使系统进入下一个最显着的位置。本论文的第一部分工作描述了这种方案的详细的计算机实现,重点在于以纯粹的激励驱动方式将跨模态的信息(这里是方向,强度和颜色信息)组合在一起的问题。该模型适用于常见的心理物理刺激以及非常苛刻的视觉搜索任务。它的成功表现用于解决灵长类动物视觉系统通过一个或多个此类显着性图进行视觉搜索的程度以及如何对其进行测试。接下来,我们要解决的问题是,一旦我们将注意力集中在视野的有限部分上,将会发生什么。越来越多的实验证据表明,注意力要比简单的前馈空间选择性过滤过程复杂得多。的确,注意聚光灯内部的视觉处理与外部的视觉处理明显不同。即,除了其作为前馈信息处理和传输瓶颈的特性之外,焦点视觉注意力还以“自上而下”的方式反馈并局部调制所选对象的视觉处理和表示。本文提出的第二项工作涉及人类基本模式视觉的详细计算模型及其自上而下的注意力调节。我们首先获得五个不同的简单心理物理实验的完整数据集,包括人类观察者对对比,方向和简单模式刺激的空间频率的区分。该实验数据集对我们的早期模式视觉模型施加了严格的约束。然而,该模型最终能够在假设合理的神经生物学成分的同时复制整个数据集。该模型进一步应用于现有的心理物理数据,该数据演示了自上而下的注意力如何改变这些简单的心理物理歧视实验的性能。我们的模型能够通过假设注意力增强视觉神经元之间的非线性皮层相互作用来量化所有观察结果。

著录项

  • 作者

    Itti, Laurent.;

  • 作者单位

    California Institute of Technology.;

  • 授予单位 California Institute of Technology.;
  • 学科 Biology Neuroscience.; Psychology Experimental.; Computer Science.
  • 学位 Ph.D.
  • 年度 2000
  • 页码 202 p.
  • 总页数 202
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类 神经科学;心理学;自动化技术、计算机技术;
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号