首页> 外文学位 >Modeling the control of attention by visual and semantic factors in real-world scenes.
【24h】

Modeling the control of attention by visual and semantic factors in real-world scenes.

机译:通过现实世界中的视觉和语义因素对注意力控制进行建模。

获取原文
获取原文并翻译 | 示例

摘要

Recently, there has been great interest among vision researchers in developing computational models that predict the distribution of saccadic eye movements in various visual tasks. The analysis of attentional guidance by low-level visual features in real- world scenes is difficult because the strength and distribution of visual saliency cannot be controlled. Moreover, real-world scenes are composed of objects with not only low-level visual features such as shape and color, but also high-level features such as meaning and semantic relations among them. While it has been found that low-level visual features guide eye movements during visual tasks in both stimulus-driven (bottom-up) and goal- driven (top-down) ways, the influence of high-level features such as meaning and semantic relations among scene objects on eye movements in such situations is still unknown.;In this manuscript, I propose a top-down model of visual attention which is based on visual similarity between the target and regions of the search scene using a histogram- matching technique, where similarity is defined for several feature dimensions such as color, orientation, or spatial frequency. The amount of attentional guidance across visual feature dimensions is predicted by a measure of informativeness of each feature dimension. Then, using a novel eye-movement analysis on visual saliency maps, I will show the dynamics between top-down and bottom-up guidance, in which top-down control is initially weak but quickly dominates search while narrowing its focus, whereas bottom-up control is slowly diminishing while using a constantly large visual span. Moreover, in order to build a foundation for a comprehensive, general computational model that also accounts for higher-level attentional guidance, I will show the existence of semantic guidance effects on eye movements, that is, subjects' tendencies of consecutively fixating semantically similar objects during scene inspection and fixating objects that are semantically similar to the search target during scene search. Semantic similarity of objects will be computed by applying Latent Semantic Analysis (LSA) to object descriptions in real-world scenes.;The results of my doctoral thesis will broaden our understanding of attentional guidance in real-world scenes under various visual tasks and should eventually lead to a general computational model of attentional guidance.
机译:最近,视觉研究人员对开发预测各种视觉任务中眼跳运动分布的计算模型产生了极大的兴趣。由于无法控制视觉显着性的强度和分布,因此很难通过现实场景中的低级视觉特征来分析注意引导。而且,真实世界的场景不仅由对象构成,这些对象不仅具有形状和颜色等低级视觉特征,而且还具有它们之间的意义和语义关系等高级特征。虽然已经发现,低级视觉功能以刺激驱动(自下而上)和目标驱动(自上而下)的方式指导视觉任务期间的眼球运动,但高级功能(如含义和语义)的影响在这种情况下,场景对象之间的眼睛运动之间的关系仍然未知。;在本文中,我提出了一种自上而下的视觉注意模型,该模型基于直方图匹配技术,基于目标和搜索场景区域之间的视觉相似性,其中为多个特征尺寸(例如颜色,方向或空间频率)定义了相似性。视觉特征维度上的注意引导量是通过衡量每个特征维度的信息量来预测的。然后,通过对视觉显着性地图进行新颖的眼动分析,我将展示自上而下和自下而上的指导之间的动态关系,其中自上而下的控制起初较弱,但在缩小搜索范围的同时迅速主导了搜索,而自下而上的使用持续较大的视觉范围时,向上控制会逐渐减弱。此外,为了建立一个综合的,通用的计算模型的基础,该模型也考虑了较高级别的注意指导,我将说明语义指导对眼睛运动的影响,即受试者连续固定语义相似对象的趋势。在场景检查过程中,并修复在语义上类似于场景搜索过程中搜索目标的对象。通过将潜在语义分析(LSA)应用于真实场景中的对象描述,可以计算出对象的语义相似性;我的博士学位论文的结果将拓宽我们对各种视觉任务下真实场景中注意指导的理解,并且最终应该导致注意指导的一般计算模型。

著录项

  • 作者

    Hwang, Alex Daejoon.;

  • 作者单位

    University of Massachusetts Boston.;

  • 授予单位 University of Massachusetts Boston.;
  • 学科 Engineering Biomedical.;Psychology Cognitive.
  • 学位 Ph.D.
  • 年度 2010
  • 页码 156 p.
  • 总页数 156
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

  • 入库时间 2022-08-17 11:37:05

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号