首页> 外文OA文献 >Natural scene classification, annotation and retrieval. Developing different approaches for semantic scene modelling based on Bag of Visual Words.
【2h】

Natural scene classification, annotation and retrieval. Developing different approaches for semantic scene modelling based on Bag of Visual Words.

机译:自然场景分类,注释和检索。开发基于视觉单词袋的语义场景建模的不同方法。

摘要

With the availability of inexpensive hardware and software, digital imaging has become an important medium of communication in our daily lives. A huge amount of digital images are being collected and become available through the internet and stored in various fields such as personal image collections, medical imaging, digital arts etc. Therefore, it is important to make sure that images are stored, searched and accessed in an efficient manner. The use of bag of visual words (BOW) model for modelling images based on local invariant features computed at interest point locations has become a standard choice for many computer vision tasks. Based on this promising model, this thesis investigates three main problems: natural scene classification, annotation and retrieval. Given an image, the task is to design a system that can determine to which class that image belongs to (classification), what semantic concepts it contain (annotation) and what images are most similar to (retrieval).udThis thesis contributes to scene classification by proposing a weighting approach, named keypoints density-based weighting method (KDW), to control the fusion of colour information and bag of visual words on spatial pyramid layout in a unified framework. Different configurations of BOW, integrated visual vocabularies and multiple image descriptors are investigated and analyzed. The proposed approaches are extensively evaluated over three well-known scene classification datasets with 6, 8 and 15 scene categories using 10-fold cross validation. The second contribution in this thesis, the scene annotation task, is to explore whether the integrated visual vocabularies generated for scene classification can be used to model the local semantic information of natural scenes. In this direction, image annotation is considered as a classification problem where images are partitioned into 10x10 fixed grid and each block, represented by BOW and different image descriptors, is classified into one of predefined semantic classes. An image is then represented by counting the percentage of every semantic concept detected in the image. Experimental results on 6 scene categories demonstrate the effectiveness of the proposed approach. Finally, this thesis further explores, with an extensive experimental work, the use of different configurations of the BOW for natural scene retrieval.
机译:随着廉价硬件和软件的出现,数字成像已成为我们日常生活中一种重要的沟通手段。大量的数字图像正在被收集并可以通过互联网使用,并存储在各个领域,例如个人图像收集,医学成像,数字艺术等。因此,确保图像的存储,搜索和访问非常重要。有效的方式。使用视觉单词袋(BOW)模型基于在兴趣点位置计算出的局部不变特征对图像进行建模已成为许多计算机视觉任务的标准选择。基于这种有希望的模型,本文研究了三个主要问题:自然场景分类,注释和检索。给定图像,任务是设计一个系统,该系统可以确定图像所属的类(分类),图像所包含的语义概念(注释)以及最类似于图像的图像(检索)。 ud通过提出一种加权方法(称为基于密度的关键点加权方法(KDW))来进行分类,以在统一框架中控制颜色信息和视觉词袋在空间金字塔布局上的融合。研究和分析了BOW的不同配置,集成的视觉词汇和多个图像描述符。所提出的方法使用10倍交叉验证在具有6、8和15个场景类别的三个知名场景分类数据集上进行了广泛评估。本文的第二个贡献是场景注释任务,目的是探索为场景分类而生成的综合视觉词汇是否可用于对自然场景的局部语义信息进行建模。在这个方向上,图像注释被认为是一种分类问题,其中图像被划分为10x10固定网格,并且每个由BOW和不同图像描述符表示的块被分类为预定义的语义类之一。然后,通过对图像中检测到的每个语义概念的百分比进行计数来表示图像。在6个场景类别上的实验结果证明了该方法的有效性。最后,本文通过广泛的实验工作进一步探索了BOW的不同配置在自然场景检索中的使用。

著录项

  • 作者

    Alqasrawi Yousef T. N.;

  • 作者单位
  • 年度 2012
  • 总页数
  • 原文格式 PDF
  • 正文语种 en
  • 中图分类

相似文献

  • 外文文献
  • 中文文献
  • 专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号