首页> 外文期刊>Multimedia Tools and Applications >Using manual and automated annotations to search images by semantic similarity
【24h】

Using manual and automated annotations to search images by semantic similarity

机译:使用手动和自动注释通过语义相似性搜索图像

获取原文
获取原文并翻译 | 示例
           

摘要

Finding semantically similar images is a problem that relies on image annotations manually assigned by amateurs or professionals, or automatically computed by some algorithm using low-level image features. These image annotations create a keyword space where a dissimilarity function quantifies the semantic relationship among images. In this setting, the objective of this paper is two-fold. First, we compare amateur to professional user annotations and propose a model of manual annotation errors, more specifically, an asymmetric binary model. Second, we examine different aspects of search by semantic similarity. More specifically, we study the accuracy of manual annotations versus automatic annotations, the influence of manual annotations with different accuracies as a result of incorrect annotations, and revisit the influence of the keyword space dimensionality. To assess these aspects we conducted experiments on a professional image dataset (Corel) and two amateur image datasets (one with 25,000 Flickr images and a second with 269,648 Flickr images) with a large number of keywords, with different similarity functions and with both manual and automatic annotation methods. We find that Amateur-level manual annotations offers better performance for top ranked results in all datasets (MP@20). However, for full rank measures (MAP) in the real datasets (Flickr) retrieval by semantic similarity with automatic annotations is similar or better than amateur-level manual annotations.
机译:查找语义上相似的图像是一个问题,它依赖于由业余人员或专业人员手动分配的图像注释,或者由某些算法使用低级图像特征自动计算得出的图像注释。这些图像注释创建一个关键字空间,其中差异函数可量化图像之间的语义关系。在这种情况下,本文的目标是双重的。首先,我们将业余用户注释与专业用户注释进行比较,并提出了一个手动注释错误模型,更具体地说,是一种不对称二进制模型。其次,我们通过语义相似性研究了搜索的不同方面。更具体地说,我们研究了手动批注与自动批注的准确性,由于批注不正确而导致具有不同精度的手动批注的影响,并重新研究了关键字空间维度的影响。为了评估这些方面,我们在专业图像数据集(Corel)和两个业余图像数据集(一个包含25,000个Flickr图像,另一个包含269,648个Flickr图像)上进行了实验,这些数据集具有大量关键字,具有相似的相似功能,并且具有手动和自动注释方法。我们发现,业余级别的手动注释可为所有数据集中排名最高的结果(MP @ 20)提供更好的性能。但是,对于真实数据集(Flickr)中的全等级度量(MAP),通过语义相似性进行自动标注的检索比业余级别的手动标注相似或更好。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号