首页> 外文期刊>ACM Transactions on Information Systems >Semisupervised SVM Batch Mode Active Learning with Applications to Image Retrieval
【24h】

Semisupervised SVM Batch Mode Active Learning with Applications to Image Retrieval

机译:半监督SVM批处理模式主动学习及其在图像检索中的应用

获取原文
获取原文并翻译 | 示例
       

摘要

Support vector machine (SVM) active learning is one popular and successful technique for relevance feedback in content-based image retrieval (CBIR). Despite the success, conventional SVM active learning has two main drawbacks. First, the performance of SVM is usually limited by the number of labeled examples. It often suffers a poor performance for the small-sized labeled examples, which is the case in relevance feedback. Second, conventional approaches do not take into account the redundancy among examples, and could select multiple examples that are similar (or even identical). In this work, we propose a novel scheme for explicitly addressing the drawbacks. It first learns a kernel function from a mixture of labeled and unlabeled data, and therefore alleviates the problem of small-sized training data. The kernel will then be used for a batch mode active learning method to identify the most informative and diverse examples via a min-max framework. Two novel algorithms are proposed to solve the related combinatorial optimization: the first approach approximates the problem into a quadratic program, and the second solves the combinatorial optimization approximately by a greedy algorithm that exploits the merits of submodular functions. Extensive experiments with image retrieval using both natural photo images and medical images show thatrnthe proposed algorithms are significantly more effective than the state-of-the-art approaches.
机译:支持向量机(SVM)主动学习是一种基于内容的图像检索(CBIR)中相关性反馈的流行且成功的技术。尽管取得了成功,但是传统的SVM主动学习有两个主要缺点。首先,SVM的性能通常受到标记示例的数量的限制。对于小尺寸的带标签示例,其性能通常很差,相关性反馈就是这种情况。其次,常规方法没有考虑示例之间的冗余,而是可以选择多个相似(甚至相同)的示例。在这项工作中,我们提出了一种新颖的方案来明确解决这些缺点。它首先从标记和未标记数据的混合中学习核函数,从而减轻了训练数据量小的问题。然后,该内核将用于批处理模式主动学习方法,以通过min-max框架识别最有用和最多样化的示例。提出了两种新颖的算法来解决相关的组合优化问题:第一种方法将问题近似为一个二次程序,第二种通过贪婪算法近似地解决组合优化问题,该算法利用了子模块功能的优点。使用自然照片图像和医学图像进行图像检索的大量实验表明,所提出的算法比最新方法有效得多。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号