首页> 外文学位 >Semantics and aesthetics inference for image search: Statistical learning approaches.
【24h】

Semantics and aesthetics inference for image search: Statistical learning approaches.

机译:图像搜索的语义和美学推理:统计学习方法。

获取原文
获取原文并翻译 | 示例

摘要

The automatic inference of image semantics is an important but highly challenging research problem whose solutions can greatly benefit content-based image search and automatic image annotation. In this thesis, I present algorithms and statistical models for inferring image semantics and aesthetics from visual content, specifically aimed at improving real-world image search. First, a novel approach to automatic image tagging is presented which furthers the state-of-the-art in both speed and accuracy. The direct use of automatically generated tags in real-world image search is then explored, and its efficacy demonstrated experimentally. An assumption which makes most annotation models misrepresent reality is that the state of the world is static, whereas it is fundamentally dynamic. I explore learning algorithms for adapting automatic tagging to different scenario changes. Specifically, a meta-learning model is proposed which can augment a black-box annotation model to help provide adaptability for personalization, time evolution, and contextual changes. Instead of retraining expensive annotation models, adaptability is achieved through efficient incremental learning of only the meta-learning component. Large scale experiments convincingly support this approach. In image search, when semantics alone yields many matches, one way to rank images further is to look beyond semantics and consider visual quality. I explore the topic of data-driven inference of aesthetic quality of images. Owing to minimal prior art, the topic is first explored in detail. Then, methods for extracting a number of high-level visual features, presumed to have correlation with aesthetics, are presented. Through feature selection and machine learning, an aesthetics inference model is trained and found to perform moderately on real-world data. The aesthetics-correlated visual features are then used in the problem of selecting and eliminating images at the high and low extremes of the aesthetics scale respectively, using a novel statistical model. Experimentally, this approach is found to work well in visual quality based filtering. Finally, I explore the use of image search techniques for designing a novel image-based CAPTCHA, a Web security test aimed at distinguishing humans from machines. Assuming image search metrics to be potential attack tools, they are used in the loop to design attack-resistant CAPTCHAs.
机译:图像语义的自动推断是一个重要但极富挑战性的研究问题,其解决方案可以极大地有益于基于内容的图像搜索和自动图像标注。在本文中,我提出了从视觉内容中推断图像语义和美感的算法和统计模型,专门针对改进现实世界中的图像搜索。首先,提出了一种新颖的自动图像标记方法,该方法在速度和准确性上都达到了最新水平。然后探讨了自动生成的标签在真实世界图像搜索中的直接使用,并通过实验证明了其有效性。使大多数注释模型无法正确反映现实的假设是,世界状态是静态的,而从根本上讲是动态的。我探索了用于使自动标记适应不同场景变化的学习算法。具体而言,提出了一种元学习模型,该模型可以增强黑匣子注释模型,以帮助提供针对个性化,时间演变和上下文变化的适应性。通过仅对元学习组件进行有效的增量学习,可以实现适应性,而不是重新训练昂贵的注释模型。大规模实验令人信服地支持这种方法。在图像搜索中,当仅语义会产生许多匹配项时,对图像进行进一步排名的一种方法是超越语义并考虑视觉质量。我探讨了图像美学质量的数据驱动推断主题。由于现有技术最少,因此首先对该主题进行了详细探讨。然后,提出了一些提取的高级视觉特征的方法,这些方法被认为与美学具有相关性。通过特征选择和机器学习,对美学推理模型进行了训练,并发现其对真实数据的表现适中。然后,使用新颖的统计模型,将与美学相关的视觉特征分别用于选择和消除美学尺度高低端图像的问题。实验上,发现这种方法在基于视觉质量的过滤中效果很好。最后,我探索了图像搜索技术在设计基于图像的新型CAPTCHA方面的用途,CAPTCHA是一种旨在区分人与机器的Web安全测试。假设图像搜索指标是潜在的攻击工具,它们将在循环中用于设计耐攻击的验证码。

著录项

  • 作者

    Datta, Ritendra.;

  • 作者单位

    The Pennsylvania State University.;

  • 授予单位 The Pennsylvania State University.;
  • 学科 Artificial Intelligence.;Computer Science.
  • 学位 Ph.D.
  • 年度 2009
  • 页码 263 p.
  • 总页数 263
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

  • 入库时间 2022-08-17 11:37:47

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号