首页> 外文期刊>Discrete optimization >On the combinatorics of the 2-class classification problem
【24h】

On the combinatorics of the 2-class classification problem

机译:关于2级分类问题的组合学

获取原文
获取原文并翻译 | 示例
           

摘要

A set of points X = X-B boolean OR X-R subset of R-d is linearly separable if the convex hulls of X-B and X-R are disjoint, hence there exists a hyperplane separating X-B from X-R. Such a hyperplane provides a method for classifying new points, according to which side of the hyperplane the new points lie. When such a linear separation is not possible, it may still be possible to partition X-B and X-R into prespecified numbers of groups, in such a way that every group from X-B is linearly separable from every group from X-R. We may also discard some points as outliers, and seek to minimize the number of outliers necessary to find such a partition. Based on these ideas, Bertsimas and Shioda proposed the classification and regression by integer optimization (CRIO) method in 2007. In this work we explore the integer programming aspects of the classification part of CRIO, in particular theoretical properties of the associated formulation. We are able to find facet-inducing inequalities coming from the stable set polytope, hence showing that this classification problem has exploitable combinatorial properties. (C) 2018 Elsevier B.V. All rights reserved.
机译:如果X-B和X-R的凸壳不相交,则一组点X = X-B布尔或X-R子集或R-D的X-R子集是线性可分离的。因此,存在来自X-R的超平面分离X-B。这种超平面提供了一种用于对新点进行分类的方法,根据中平面的哪一侧是新的点谎言。当不可能这样的线性分离时,仍然可以将X-B和X-R分配成预定数量的组,使得来自X-B的每个组与来自X-R的每个组线性可分离。我们也可能丢弃一些点作为异常值,并寻求最小化寻找此类分区所必需的异常值的数量。基于这些想法,Bertsimas和ShioIoda在2007年提出了整数优化(CRIO)方法的分类和回归。在这项工作中,我们探讨了CRO的分类部分的整数编程方面,特别是相关配方的理论性质。我们能够找到来自稳定集多孔的方面诱导的不等式,因此显示该分类问题具有可利用的组合性质。 (c)2018 Elsevier B.v.保留所有权利。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号