首页> 外文期刊>Neural computation >SEEMORE: Combining Color, Shape, and Texture Histogramming in a Neurally Inspired Approach to Visual Object Recognition
【24h】

SEEMORE: Combining Color, Shape, and Texture Histogramming in a Neurally Inspired Approach to Visual Object Recognition

机译:SEEMORE:在自然启发的视觉对象识别方法中结合颜色,形状和纹理直方图

获取原文
获取原文并翻译 | 示例

摘要

Severe architectural and timing constraints within the primate visual system support the conjecture that the early phase of object recognition in the brain is based on a feedforward feature-extraction hierarchy. To assess the plausibility of this conjecture in an engineering context, a difficult three-dimensional object recognition domain was developed to challenge a pure feedforward, receptive-field based recognition model called SEEMORE. SEEMORE is based on 102 viewpoint-invariant nonlinear filters that as a group are sensitive to contour, texture, and color cues. The visual domain consists of 100 real objects of many different types, including rigid (shovel), nonrigid (telephone cord), and statistical (maple leaf cluster) objects and photographs of complex scenes. Objects were in dividually presented in color video images under normal room lighting conditions. Based on 12 to 36 training views, SEEMORE was required to recognize unnormalized test views of objects that could vary in position, orientation in the image plane and in depth, and scale (factor of 2); for non rigid objects, recognition was also tested under gross shape deformations. Correct classification performance on a test set consisting of 600 novel object views was 97 percent (chance was 1 percent) and was comparable for the subset of 15 nonrigid objects. Performance was also measured under a variety of image degradation conditions, including partial occlusion, limited clutter, color shift, and additive noise. Generalization behavior and classification errors illustrate the emergence of several striking natural shape categories that are not explicitly encoded in the dimensions of the feature space. It is concluded that in the light of the vast hardware resources available in the ventral stream of the primate visual system relative to those exercised here, the appealingly simple feature-space conjecture remains worthy of serious consideration as a neurobiological model.
机译:灵长类动物视觉系统中严格的架构和时序约束支持这样的推测,即大脑中对象识别的早期阶段是基于前馈特征提取层次结构的。为了在工程环境中评估这种推测的合理性,开发了一个困难的三维物体识别域,以挑战称为SEEMORE的纯前馈,基于接收场的识别模型。 SEEMORE基于102个视点不变的非线性滤波器,这些滤波器作为一个整体对轮廓,纹理和颜色提示敏感。视域由100种不同类型的真实对象组成,包括刚性(铲子),非刚性(电话线)和统计(枫叶簇)对象以及复杂场景的照片。在正常的室内照明条件下,对象分别以彩色视频图像呈现。根据12到36个训练视图,需要SEEMORE来识别对象的非规范化测试视图,这些对象的位置,图像平面上的方向以及深度和比例(因子2)可能有所不同;对于非刚性物体,还对总体形状变形下的识别性进行了测试。在由600个新颖的对象视图组成的测试集上,正确的分类性能为97%(机会为1%),并且与15个非刚性对象的子集相当。还可以在各种图像质量下降的条件下测量性能,包括部分遮挡,有限的杂波,色偏和加性噪点。泛化行为和分类错误说明了几种醒目的自然形状类别的出现,这些类别未在特征空间的维度中明确编码。结论是,鉴于灵长类动物视觉系统腹侧流相对于此处行使的硬件资源可利用的大量硬件资源,吸引人的简单特征空间猜想仍然值得作为神经生物学模型加以认真考虑。

著录项

  • 来源
    《Neural computation》 |1997年第4期|777-804|共28页
  • 作者

    Mel B;

  • 作者单位

    Department of Biomedical Engineering, University of Southern California, Los Angeles, California 90089 USA;

  • 收录信息 美国《科学引文索引》(SCI);美国《化学文摘》(CA);
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号