首页> 外文学位 >Ideal observer analysis of object recognition.
【24h】

Ideal observer analysis of object recognition.

机译:理想的观察者分析对象识别。

获取原文
获取原文并翻译 | 示例

摘要

General-purpose object recognition is an important function of the human visual system. We applied the quantitative techniques that have been successful for studying low-level sensory processing to the study of visual object recognition. We emphasized that the performance of a visual system depends on two factors: (1) how the system processes visual information, and (2) how much task-relevant information there is to be processed. By constructing an ideal observer for visual object recognition, we proposed four quantitative measurements to characterize these factors. Statistical efficiency measures the accuracy of a visual system relative to the informational constraints inherent in a task. View-processing rate gauges a visual system's processing speed with respect to a task's inherent requirements for representing the details of objects. These two measurements allow direct comparison of a visual system's processing capability across different visual recognition tasks. A third measure, stimulus informativeness determines the maximally achievable level of accuracy for a given task across all visual systems. Finally, view complexity quantifies a task's representational requirements, which are imposed on any observer. We applied these measurement techniques to resolve issues in object recognition research. Specifically, we (1) determined the factors that limit human processing capability for object recognition, (2) measured the limits on letter recognition for different fonts when they were presented in noise, spatial uncertainty and blur, and (3) measured the equivalent number of 2-D views required to represent different 3-D object ensembles to maximally facilitate recognition. We also proposed a framework, based on task partitioning, for the analysis of the general-purpose visual object recognition problem. This framework defines an object recognition task in terms of its input and context. In addition, it expresses an implementation of an object recognition task in terms of a network of other recognition tasks. The information sources for the input and context of each task in an implementation are made explicit by this framework.
机译:通用目标识别是人类视觉系统的重要功能。我们将已经成功用于研究低水平感官加工的定量技术应用于视觉对象识别的研究。我们强调视觉系统的性能取决于两个因素:(1)系统如何处理视觉信息,(2)要处理多少与任务相关的信息。通过构建理想的视觉对象识别观察者,我们提出了四个定量测量来表征这些因素。统计效率衡量视觉系统相对于任务固有信息约束的准确性。视图处理速率根据代表对象详细信息的任务固有要求来衡量视觉系统的处理速度。这两个测量值可以直接比较视觉系统在不同视觉识别任务之间的处理能力。第三种措施,刺激信息性,决定了所有视觉系统中给定任务的最大可达到的准确性水平。最后,视图复杂性量化了任务的表示要求,这些要求被强加给任何观察者。我们应用这些测量技术来解决对象识别研究中的问题。具体来说,我们(1)确定了限制人为识别对象的处理能力的因素;(2)测量了以噪声,空间不确定性和模糊呈现的不同字体对字母识别的限制,以及(3)测量了等效数字代表不同3D对象集合所需的2D视图的最大数量,以最大程度地促进识别。我们还提出了一个基于任务划分的框架,用于分析通用视觉对象识别问题。该框架根据其输入和上下文定义了对象识别任务。另外,它根据其他识别任务的网络表示对象识别任务的实现。此框架明确了实现中每个任务的输入和上下文的信息源。

著录项

  • 作者

    Tjan, Bosco Siautung.;

  • 作者单位

    University of Minnesota.;

  • 授予单位 University of Minnesota.;
  • 学科 Computer Science.;Psychology Cognitive.;Psychology Experimental.
  • 学位 Ph.D.
  • 年度 1997
  • 页码 195 p.
  • 总页数 195
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号