首页> 外文会议>ICGCE 2013 >Visual Content Based Clustering of Near Duplicate Web Search Images
【24h】

Visual Content Based Clustering of Near Duplicate Web Search Images

机译:基于Visual Content基于近重复网页搜索图像的聚类

获取原文

摘要

Near-duplicate detection has received substantial attention over the past few years due to applications in copyright enforcement, organizing large image databases, increasing focus in image search, duplication elimination of logos, saving storage space by removing redundancy, etc. In case of document images, near-duplicate detection can be used to increase the efficiency of tagging the documents by reducing the need for manual inspection of the documents. In this paper, an approach is presented to detect near-duplicate images using feature extraction and clustering process. Initially as a preprocessing step, noise removal and image enhancement is done. Image features are used for feature extraction and also for clustering the images. Appropriate similarity measure is used in accordance to the clustering algorithm. Clustering of images is performed which is followed by its evaluation. From the result of evaluation, the clustering process is refined to get better clusters. Each of these clusters will have one image as a representative of that cluster and other images in the cluster is called its near-duplicates. Finally performance measure is calculated for evaluating the algorithm accuracy.
机译:由于版权执法中的应用,组织大型图像数据库,增加了图像搜索,重复消除徽标,通过去除冗余等冗余等焦点,在过去几年中获得了近几年的重复检测。 ,近重复检测可用于通过减少对文档的手动检查的需求来提高标记文档的效率。在本文中,提出一种方法来使用特征提取和聚类过程检测近副本图像。最初作为预处理步骤,完成噪声去除和图像增强。图像特征用于特征提取,也用于聚类图像。根据聚类算法使用适当的相似度量。执行图像的聚类,然后进行评估。从评估结果,聚类过程被精制获得更好的集群。这些群集中的每一个都将具有一个图像作为该群集的代表,并且群集中的其他图像称为近双复制。最后计算性能测量来评估算法精度。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号