首页> 外文学位 >Image retrieval based on complex descriptive queries.
【24h】

Image retrieval based on complex descriptive queries.

机译:基于复杂描述性查询的图像检索。

获取原文
获取原文并翻译 | 示例

摘要

The amount of visual data such as images and videos available over web has increased exponentially over the last few years. In order to efficiently organize and exploit these massive collections, a system, apart from being able to answer simple classification based questions such as whether a specific object is present (or absent) in an image, should also be capable of searching images and videos based on more complex descriptive questions. There is also a considerable amount of structure present in the visual world which, if effectively utilized, can help achieve this goal. To this end, we first present an approach for image ranking and retrieval based on queries consisting of multiple semantic attributes. We further show that there are significant correlations present between these attributes and accounting for them can lead to superior performance. Next, we extend this by proposing an image retrieval framework for descriptive queries composed of object categories, semantic attributes and spatial relationships. The proposed framework also includes a unique multi-view hashing technique, which enables query specification in three different modalities---image, sketch and text.;We also demonstrate the effectiveness of leveraging contextual information to reduce the supervision requirements for learning object and scene recognition models. We present an active learning framework to simultaneously learn appearance and contextual models for scene understanding. Within this framework we introduce new kinds of labeling questions that are designed to collect appearance as well as contextual information and which mimic the way in which humans actively learn about their environment. Furthermore we explicitly model the contextual interactions between the regions within an image and select the question which leads to the maximum reduction in the combined entropy of all the regions in the image (image entropy).
机译:在过去的几年中,可通过网络获得的视觉数据(例如图像和视频)数量呈指数增长。为了有效地组织和利用这些庞大的馆藏,该系统除了能够回答基于简单分类的问题(例如图像中是否存在特定对象)外,还应该能够搜索基于图像和视频的图像。关于更复杂的描述性问题。视觉世界中还存在大量结构,如果有效利用它们,可以帮助实现这一目标。为此,我们首先提出一种基于包含多个语义属性的查询的图像排名和检索方法。我们进一步表明,这些属性之间存在显着的相关性,考虑到它们可以导致更高的性能。接下来,我们通过为描述性查询提出一个图像检索框架来扩展此范围,该描述性查询由对象类别,语义属性和空间关系组成。提出的框架还包括一种独特的多视图哈希技术,该技术可以以三种不同的模式(图像,草图和文本)进行查询规范;我们还展示了利用上下文信息来减少对学习对象和场景的监管要求的有效性识别模型。我们提出了一个主动的学习框架,可以同时学习外观和上下文模型以进行场景理解。在此框架内,我们引入了新的标签问题,这些标签问题旨在收集外观和上下文信息,并模仿人类积极了解其环境的方式。此外,我们显式地对图像内区域之间的上下文交互进行建模,并选择问题,该问题导致图像中所有区域的组合熵(图像熵)最大降低。

著录项

  • 作者

    Siddiquie, Behjat.;

  • 作者单位

    University of Maryland, College Park.;

  • 授予单位 University of Maryland, College Park.;
  • 学科 Computer Science.
  • 学位 Ph.D.
  • 年度 2011
  • 页码 146 p.
  • 总页数 146
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号