Ideal observer analysis of object recognition.

机译：理想的观察者分析对象识别。

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

General-purpose object recognition is an important function of the human visual system. We applied the quantitative techniques that have been successful for studying low-level sensory processing to the study of visual object recognition. We emphasized that the performance of a visual system depends on two factors: (1) how the system processes visual information, and (2) how much task-relevant information there is to be processed. By constructing an ideal observer for visual object recognition, we proposed four quantitative measurements to characterize these factors. Statistical efficiency measures the accuracy of a visual system relative to the informational constraints inherent in a task. View-processing rate gauges a visual system's processing speed with respect to a task's inherent requirements for representing the details of objects. These two measurements allow direct comparison of a visual system's processing capability across different visual recognition tasks. A third measure, stimulus informativeness determines the maximally achievable level of accuracy for a given task across all visual systems. Finally, view complexity quantifies a task's representational requirements, which are imposed on any observer. We applied these measurement techniques to resolve issues in object recognition research. Specifically, we (1) determined the factors that limit human processing capability for object recognition, (2) measured the limits on letter recognition for different fonts when they were presented in noise, spatial uncertainty and blur, and (3) measured the equivalent number of 2-D views required to represent different 3-D object ensembles to maximally facilitate recognition. We also proposed a framework, based on task partitioning, for the analysis of the general-purpose visual object recognition problem. This framework defines an object recognition task in terms of its input and context. In addition, it expresses an implementation of an object recognition task in terms of a network of other recognition tasks. The information sources for the input and context of each task in an implementation are made explicit by this framework.

机译：通用目标识别是人类视觉系统的重要功能。我们将已经成功用于研究低水平感官加工的定量技术应用于视觉对象识别的研究。我们强调视觉系统的性能取决于两个因素：（1）系统如何处理视觉信息，（2）要处理多少与任务相关的信息。通过构建理想的视觉对象识别观察者，我们提出了四个定量测量来表征这些因素。统计效率衡量视觉系统相对于任务固有信息约束的准确性。视图处理速率根据代表对象详细信息的任务固有要求来衡量视觉系统的处理速度。这两个测量值可以直接比较视觉系统在不同视觉识别任务之间的处理能力。第三种措施，刺激信息性，决定了所有视觉系统中给定任务的最大可达到的准确性水平。最后，视图复杂性量化了任务的表示要求，这些要求被强加给任何观察者。我们应用这些测量技术来解决对象识别研究中的问题。具体来说，我们（1）确定了限制人为识别对象的处理能力的因素；（2）测量了以噪声，空间不确定性和模糊呈现的不同字体对字母识别的限制，以及（3）测量了等效数字代表不同3D对象集合所需的2D视图的最大数量，以最大程度地促进识别。我们还提出了一个基于任务划分的框架，用于分析通用视觉对象识别问题。该框架根据其输入和上下文定义了对象识别任务。另外，它根据其他识别任务的网络表示对象识别任务的实现。此框架明确了实现中每个任务的输入和上下文的信息源。

著录项

作者
Tjan, Bosco Siautung.;
展开▼
作者单位

University of Minnesota.;

展开▼
授予单位 University of Minnesota.;
学科 Computer Science.;Psychology Cognitive.;Psychology Experimental.
学位 Ph.D.
年度 1997
页码 195 p.
总页数 195
原文格式 PDF
正文语种 eng
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. Multiple object tracking difficulty accounted for by an ideal observer [J] . Cory Rieth, Edward Vul Journal of vision . 2013,第9期

机译：理想观察者解决的多目标跟踪困难
2. Classification objects, ideal observers & generative models [J] . Cheryl Olman, Daniel Kersten Cognitive Science . 2004,第2期

机译：分类对象，理想观察者和生成模型
3. Rapid categorization of faces and objects in a patient with impaired object recognition. [J] . Boucart M, Moroni C, Despretz P, Neurocase: case studies in neuropsychology, neuropsychiatry, and behavioural neurology . 2010,第2期

机译：对对象识别能力受损的患者中的脸部和对象进行快速分类。
4. Adaptive image analysis for object recognition. I. Entropic object location [C] . Vaidyanathan, A.G., Whitcomb, . 1995

机译：用于物体识别的自适应图像分析。一，熵物体的位置
5. Ideal observer estimation and generalized ROC analysis for computer-aided diagnosis. [D] . Edwards, Darrin C. 2003

机译：用于计算机辅助诊断的理想观察者估计和广义ROC分析。
6. Using an ideal observer analysis to investigate the visual perceptual efficiency of individuals with a history of non-suicidal self-injury when identifying emotional expressions [O] . Laura Ziebell, Charles Collin, Stéphane Rainville, 2020

机译：使用理想的观察者分析来调查具有非自杀性自残历史的个人在识别情绪表达时的视觉感知效率
7. Haptic SLAM: An Ideal Observer Model for Bayesian Inference of Object Shape and Hand Pose from Contact Dynamics [O] . Behbahani FMP, Singla–Buxarrais G, Faisal AA 2016

机译：触觉sLam：一种理想的观测器模型，用于贝叶斯推理物体形状和手部姿态的接触动力学
8. Analysis of the Effect of Gaussian Error in Object Recognition. [R] . Sarachik, K. B. 1994

机译：高斯误差在物体识别中的作用分析。

Ideal observer analysis of object recognition.

摘要

著录项

相似文献

相关主题

期刊订阅