首页> 外文会议>Joint symposium on neural computation >Reading english test with spatially-invariant recptive fields: when the 'binding problem' isn't
【24h】

Reading english test with spatially-invariant recptive fields: when the 'binding problem' isn't

机译:使用空间不变的移除字段阅读英语测试:当“绑定问题”不是

获取原文

摘要

Accumulating experimental evidence suggests that 91) recognition of complex objects and scenes can sometimes occur in a single feedforward pass through the visual system (Fize et al., 1998), and (2) the visual field may be encoded during this first pass in terms of a population of spatially-invariant detectors of visual mini-patterns (Kobatake and Tanaka, 1994). We study here some of the computational properties of visual representations based on spatially-invariant receptive fields (RF's), using the domain of text as a convenient surrogate for visual recognition in general. We begin by developing an analytical model that makes explicit how recognition performance is affected by (1) the number of object categories (workds) that must be distinguished, (2) the complexity of individual objects (length of workds), (3) the number and order of binding of elemental features (letters) included in the representation, and (4) the clutter load, i.e. the amount of visual material (text) in the field of view in which multiple objects must be recognized with-out explicit segmentation. We show that that the model achieves good fits to recognition rates for English text over a wide range of clutter loads, word sizes, and feature counts. We then show, using a quasi-supervised greedy algorithm for feature learning, that fewer than 1,500 mostly low-order, spatially-invariant letter-tuple detectors (akin to Wichelfeatures) are needed to unambiguously represent all the workds simultaneously present in randomly chosen windown tof text up to 50 characters in width. Our results help explain how, and under what conditions, spatially-invariant RF-based representations can process multiple objects simultaneously witout explicit segmentation processes, and they lend support to the notion that representations of this simple kind may underlie important aspects of primate/human recognition.
机译:积累的实验证据表明,91)识别复合物体和场景的有时可以通过视觉系统发生在一个单一的前馈通(Fize等人,1998),和(2)的视场可以在术语此第一遍期间编码的视觉微型图案(Kobatake和Tanaka,1994)在空间上不变的检测器的群体。在这里,我们研究一些基于空间不变的感受野(RF的)的可视化表示的计算性能,使用文本域作为一般的视觉识别方便的替代品。首先,我们开发的分析模型如何识别性能由(1),其必须区分对象类别(workds)的(2)的数量,各个对象的复杂性(workds的长度),(3)的影响,使得明确的数量和包括在表示元素特征(字母)的结合顺序,和(4)的杂波负荷,即,在视场中可视材料(文本)的量,其中,多个对象必须与出明确的分段识别。我们表明,该模型取得了较好的拟合,以识别率的英文文本在很大范围内杂乱的负载,文字大小和功能计数。然后,我们表明,使用地物学习准监督贪婪算法,即少于1500大多低阶的,空间不变信元组检测器(类似于Wichelfeatures)需要明确地表示所有workds同时存在于随机选择windown TOF文本最多50个字符宽。我们的研究结果有助于解释如何,什么条件下,空间不变的基于RF-表示可以处理多个对象同时不用其他明确的分割过程中,他们支持了这一概念,这个简单形式的陈述背后可能灵长类/人认可的重要方面。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号