首页> 外文学位 >Finding Birds in Trees: Building Categories from Image Streams.
【24h】

Finding Birds in Trees: Building Categories from Image Streams.

机译:在树上找鸟:从图像流构建类别。

获取原文
获取原文并翻译 | 示例

摘要

We explore automated object detection and categorization in image sequences within the context of natural environments . Inherent in these environments are significant challenges to be modeled---for example, complex texture, background motion, and object mimicry. We present a general background model that is applicable to natural scenes. Our approach models the underlying warping of pixel locations arising from background motion. The background is modeled as a set of warping layers where, at any given time, different layers may be visible due to the motion of an occluding layer. Foreground regions are thus defined as those that cannot be modeled by some composition of some warping of these background layers. We illustrate this concept by first reducing the possible warps to those where the pixels are restricted to displacements within a spatial neighborhood, and then learning the appropriate size of that spatial neighborhood. Then, we show how changes in intensity/color histograms of pixel neighborhoods can be used to discriminate foreground and background regions. We find that this approach compares favorably with the state of the art, while requiring less computation.;We have designed and implemented a system for cataloging putative objects of interest into viewable clusters from an image sequence and user input. We introduce two object representations. One is a set of feature histograms, each corresponding to a viewpoint of the object. The other is an object barcode that represents whether or not a feature is present across all views. The approach is unbiased towards redundant views---that is, it does not matter how many times an object appears from the same viewpoint. At the same time, the approach does not penalize for missing views---so that successful object categorization does not require capturing all viewpoints. We use these representations to cluster objects into viewable clusters that users can label according to the categories of their interest. We then feed these labels back into the system to automatically label new objects that appear in the image sequence. We find that the system significantly reduces the amount of time users would spend looking at uninformative images.
机译:我们探索自然环境中图像序列中的自动对象检测和分类。在这些环境中固有的是要建模的重大挑战,例如,复杂的纹理,背景运动和对象模仿。我们提出了适用于自然场景的一般背景模型。我们的方法模拟了由背景运动引起的像素位置的潜在变形。背景被建模为一组变形层,其中在任何给定时间,由于遮挡层的运动,不同的层可能可见。因此,将前景区域定义为无法通过这些背景层的某些变形的某些组合来建模的区域。我们首先通过将可能的扭曲减少到像素受限于空间邻域内的位移的那些扭曲,然后学习该空间邻域的适当大小来说明此概念。然后,我们展示如何使用像素邻域的强度/颜色直方图的变化来区分前景和背景区域。我们发现,该方法与现有技术相比具有优势,同时所需的计算量更少。我们已经设计并实现了一种系统,该系统可根据图像序列和用户输入将感兴趣的假定对象分类为可见的群集。我们介绍两种对象表示形式。一个是一组特征直方图,每个特征直方图对应于对象的视点。另一个是对象条形码,它表示所有视图中是否都存在功能。该方法无偏向于冗余视图,也就是说,从同一视点来看一个对象出现多少次并不重要。同时,该方法不会因缺少视图而受到惩罚,因此成功的对象分类不需要捕获所有视点。我们使用这些表示将对象聚类为可见的聚类,用户可以根据自己感兴趣的类别对其进行标记。然后,我们将这些标签反馈回系统,以自动标记出现在图像序列中的新对象。我们发现,该系统大大减少了用户花费在查看无信息图像上的时间。

著录项

  • 作者

    Ko, Teresa H.;

  • 作者单位

    University of California, Los Angeles.;

  • 授予单位 University of California, Los Angeles.;
  • 学科 Computer Science.
  • 学位 Ph.D.
  • 年度 2010
  • 页码 108 p.
  • 总页数 108
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号