首页> 外文学位 >Combining Visual Features and Contextual Information for Image Retrieval and Annotation.
【24h】

Combining Visual Features and Contextual Information for Image Retrieval and Annotation.

机译:结合视觉特征和上下文信息进行图像检索和注释。

获取原文
获取原文并翻译 | 示例

摘要

This thesis is primarily focused on the information combination at different levels of a statistical pattern classification framework for image annotation and retrieval. Based on the previous study within the fields of image annotation and retrieval, it has been well-recognized that the low-level visual features, such as color and texture, and high-level features, such as textual description and context, are distinct yet complementary in terms of their distributions and the corresponding discriminative powers for dealing with machine-based recognition and retrieval tasks. Therefore, effective feature combination for image annotation and retrieval has become a desirable and promising perspective from which the semantic gap can be further bridged. Motivated by this fact, the combination of the visual and context modalities and that of different features in the visual domain are tackled by developing two statistical pattern classification approaches considering that the features of the visual modality and those across different modalities exhibit different degrees of heterogeneities, and thus, should be treated differently. Regarding the cross-modality feature combination, a Bayesian framework is proposed to integrate visual content and context, which has been applied to various image annotation and retrieval frameworks. In terms of the combination of different low-level features in the visual domain, the problem is tackled with a novel method that combines texture and color features via a mixture model of their joint distribution. To evaluate the proposed frameworks, many different datasets are employed in the experiments, including the COREL database for image retrieval and the MSRC, LabelMe, PASCAL VOC2009, and an animal image database collected by ourselves for image annotation. Using various evaluation criteria, the first framework is shown to be more effective than the methods purely based on the low-level features or high-level context. As for the second, the experimental results demonstrate not only its superior performance to other feature combination methods but also its ability to discover visual clusters using texture and color simultaneously. Moreover, a demo search engine based on the Bayesian framework is implemented and available online.
机译:本文主要研究用于图像注释和检索的统计模式分类框架不同层次的信息组合。根据先前在图像注释和检索领域的研究,人们已经认识到,低级视觉特征(例如颜色和纹理)和高阶特征(例如文本描述和上下文)是截然不同的在它们的分布和用于处理基于机器的识别和检索任务的相应判别能力方面具有互补性。因此,用于图像注释和检索的有效特征组合已经成为可望和有希望的观点,从该观点可以进一步弥合语义鸿沟。出于这一事实,考虑到视觉模态和跨不同模态的特征表现出不同程度的异质性,开发两种统计模式分类方法可以解决视觉和上下文模态以及视觉领域中不同特征的组合问题,因此,应区别对待。关于跨模态特征的组合,提出了一种贝叶斯框架来整合视觉内容和上下文,并已应用于各种图像标注和检索框架。就视觉领域中不同的低级特征的组合而言,该问题通过一种新颖的方法来解决,该方法通过其联合分布的混合模型将纹理和颜色特征组合在一起。为了评估提出的框架,实验中使用了许多不同的数据集,包括用于图像检索的COREL数据库和MSRC,LabelMe,PASCAL VOC2009,以及我们自己收集的用于图像注释的动物图像数据库。通过使用各种评估标准,第一个框架比纯基于低层功能或高层上下文的方法更有效。至于第二个,实验结果表明,它不仅具有优于其他特征组合方法的性能,而且还具有同时使用纹理和颜色发现视觉集群的能力。此外,基于贝叶斯框架的演示搜索引擎已实现并可以在线获得。

著录项

  • 作者

    Zhang, Rui.;

  • 作者单位

    Ryerson University (Canada).;

  • 授予单位 Ryerson University (Canada).;
  • 学科 Engineering Electronics and Electrical.
  • 学位 Ph.D.
  • 年度 2011
  • 页码 197 p.
  • 总页数 197
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

  • 入库时间 2022-08-17 11:44:20

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号