首页> 外文期刊>Machine Learning >A theoretical and empirical analysis of support vector machine methods for multiple-instance classification
【24h】

A theoretical and empirical analysis of support vector machine methods for multiple-instance classification

机译:支持向量机的多实例分类理论与实证分析

获取原文
获取原文并翻译 | 示例
       

摘要

The standard support vector machine (SVM) formulation, widely used for supervised learning, possesses several intuitive and desirable properties. In particular, it is convex and assigns zero loss to solutions if, and only if, they correspond to consistent classifying hyperplanes with some nonzero margin. The traditional SVM formulation has been heuristically extended to multiple-instance (MI) classification in various ways. In this work, we analyze several such algorithms and observe that all MI techniques lack at least one of the desirable properties above. Further, we show that this tradeoff is fundamental, stems from the topological properties of consistent classifying hyperplanes for MI data, and is related to the computational complexity of learning MI hyperplanes. We then study the empirical consequences of this three-way tradeoff in MI classification using a large group of algorithms and datasets. We find that the experimental observations generally support our theoretical results, and properties such as the labeling task (instance versus bag labeling) influence the effects of different tradeoffs.
机译:被广泛用于监督学习的标准支持向量机(SVM)公式具有一些直观和理想的属性。特别是,它是凸的,并且仅当它们对应于具有某些非零裕度的一致分类超平面时,才为解决方案分配零损失。传统的SVM公式已通过各种方式试探性地扩展到多实例(MI)分类。在这项工作中,我们分析了几种这样的算法,并观察到所有MI技术都缺乏上述理想特性中的至少一种。此外,我们表明,这种折衷是基本的,源于对MI数据进行一致分类的超平面的拓扑特性,并且与学习MI超平面的计算复杂性有关。然后,我们使用大量算法和数据集研究MI分类中此三向折衷的经验结果。我们发现实验观察通常支持我们的理论结果,并且诸如标记任务(实例与袋标记)之类的属性会影响不同权衡的影响。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号