首页> 外文会议>ACM international conference on multimedia >Nearest-Neighbor Classification Using Unlabeled Data for Real World Image Application
【24h】

Nearest-Neighbor Classification Using Unlabeled Data for Real World Image Application

机译:最近的邻邻分类,使用未标记数据进行真实世界图像应用程序

获取原文

摘要

Currently, Nearest-Neighbor approaches (NN) have been widely applied to real world image data mining. These approaches have the following three disadvantages: (ⅰ) the performance is inferior on small datasets; (ⅱ) the performance of approximated nearest neighbor search will degrade for data with high dimensions; (ⅲ) they are heavily dependent on the chosen feature and distance measure. To overcome these intrinsic weaknesses, we propose a novel Nearest-Neighbor method, which improves the original NN approaches from three aspects. Firstly, we propose a novel neighborhood similarity measure, where the similarity between test images and labeled images in the database is calculated jointly by the original image-to-image similarity and the average similarity of their neighboring unlabeled data. Secondly, we adopt the kernelized locality sensitive hashing to effectively conduct the nearest neighbor search for high dimensional data. Finally, to enhance the robustness of the method on different genres of images, we propose to fuse the discrimination power of different features by considering all the retrieved nearest neighbors via hashing systems using different features/kernels. Experimental result shows the advantage over traditional Nearest-Neighbor methods using the labeled data only. Even when the ratio of labeled data is very small, our method could also achieve remarkable results, thanks to the help of unlabeled data and multiple features.
机译:目前,最近邻近的方法(NN)已被广泛应用于现实世界图像数据挖掘。这些方法具有以下三个缺点:(Ⅰ)在小型数据集上的性能低劣; (Ⅱ)近似邻近搜索的性能将降低具有高维度的数据; (Ⅲ)它们严重依赖于所选择的特征和距离测量。为了克服这些内在的弱点,我们提出了一种新的最近邻的方法,其改善了来自三个方面的原始NN方法。首先,我们提出了一种新颖的邻域相似度测量,其中数据库中的测试图像和标记图像之间的相似性通过原始图像到图像相似性和其相邻未标记数据的平均相似性来协同计算。其次,我们采用内核局部敏感散列,以有效地开展最近的邻居搜索高维数据。最后,为了增强在不同类型的不同类型上的方法的鲁棒性,我们建议通过使用不同特征/核的散列系统考虑所有检索到的最近邻居,融合不同特征的辨别力。实验结果显示了使用标记数据的传统最近邻方法的优势。即使当标记数据的比率很小时,由于未标记数据和多个功能的帮助,我们的方法也可以实现显着的结果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号