首页> 外文期刊>Pattern recognition letters >Multiple instance learning based on positive instance selection and bag structure construction
【24h】

Multiple instance learning based on positive instance selection and bag structure construction

机译:基于积极实例选择和包结构构建的多实例学习

获取原文
获取原文并翻译 | 示例
           

摘要

Previous studies on multiple instance learning (MIL) have shown that the MIL problem holds three characteristics: positive instance clustering, bag structure and instance probabilistic influence to bag label. In this paper, combined with the advantages of these three characteristics, we propose two simple yet effective MIL algorithms, CK_MIL and CK_MIL. We take three steps to convert MIL to a standard supervised learning problem. In the first step, we perform K-means clustering algorithm on the positive and negative sets separately to obtain the cluster centers, further use them to select the most positive instances in bags. Next, we combine three distances, including the maximum, minimum and the average distances from bag to cluster centers, as bag structure. For CK_MIL, we simply compose the positive instance and bag structure to form a new vector as bag representation, then apply RBF kernel to measure bag similarity, while for ck_MIL algorithm we construct a new kernel by introducing a probabilistic coefficient to balance the influences between the positive instance similarity and bag structure similarity. As a result, the MIL problem is converted to a standard supervised learning problem that can be solved directly by SVM method. Experiments on MUSK and COREL image set have shown that our two algorithms perform better than other key existing MIL algorithms on the drug prediction and image classification tasks.
机译:先前对多实例学习(MIL)的研究表明,MIL问题具有三个特征:积极的实例聚类,袋子结构和实例对袋子标签的概率影响。在本文中,结合这三个特性的优点,我们提出了两种简单而有效的MIL算法,即CK_MIL和CK_MIL。我们采取三个步骤将MIL转换为标准的监督学习问题。第一步,我们分别对正集和负集执行K-means聚类算法以获得聚类中心,进一步使用它们来选择袋中最正的实例。接下来,我们结合三个距离,包括袋子到群集中心的最大,最小和平均距离,作为袋子结构。对于CK_MIL,我们简单地组合正实例和bag结构以形成一个新的向量作为bag表示,然后应用RBF内核来测量bag的相似度,而对于ck_MIL算法,我们通过引入一个概率系数来平衡两个变量之间的影响来构造一个新内核。正实例相似性和包结构相似性。结果,MIL问题被转换为可以通过SVM方法直接解决的标准监督学习问题。在MUSK和COREL图像集上进行的实验表明,在药物预测和图像分类任务上,我们的两种算法的性能优于现有的其他关键MIL算法。

著录项

  • 来源
    《Pattern recognition letters》 |2014年第15期|19-26|共8页
  • 作者单位

    School of Information Science and Technology, Northwest University, Xi'an 710069, China;

    School of Information Science and Technology, Northwest University, Xi'an 710069, China;

    School of Information Science and Technology, Northwest University, Xi'an 710069, China;

    School of Information Science and Technology, Northwest University, Xi'an 710069, China;

    School of Information Science and Technology, Northwest University, Xi'an 710069, China;

    School of Computer Science and Engineering and School of Automation and Information Engineering, Xi'an University of Technology, Xi'an 710048, China;

  • 收录信息 美国《科学引文索引》(SCI);美国《工程索引》(EI);
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

    Multiple instance learning (MIL); Support vector machine (SVM); K-means clustering; Multiple kernel;

    机译:多实例学习(MIL);支持向量机(SVM);K-均值聚类;多核;

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号