首页> 外文期刊>Electronic Notes in Theoretical Computer Science >Minimization of Gini Impurity: NP-completeness and Approximation Algorithm via Connections with the k-means Problem
【24h】

Minimization of Gini Impurity: NP-completeness and Approximation Algorithm via Connections with the k-means Problem

机译:基尼杂质的最小化:与k-means问题有关的NP完整性和近似算法

获取原文
获取外文期刊封面目录资料

摘要

The Gini impurity is a very popular criterion to select attributes during decision trees construction. In the problem of finding a partition with minimum weighted Gini impurity (PMWGP), the one faced during the construction of decision trees, a set of vectors must be partitioned intokdifferent clusters such that the partition's overall Gini impurity is minimized.We show thatPMWGPis APX-hard for arbitrarykand admits a randomized PTAS when the number of clusters is fixed. These results significantly improve the current knowledge on the problem. The key idea to obtain these results is to explore connections betweenPMWGPand the geometrick-means clustering problem.
机译:基尼杂质是在决策树构建过程中选择属性的一种非常流行的标准。在寻找具有最小加权基尼杂质(PMWGP)的分区的问题时,决策树的构建过程中面临着一个问题,必须将一组矢量划分为不同的簇,以使该分区的整体基尼杂质最小化。我们证明了PMWGPis APX-当簇数固定时,很难对任意k接受随机PTAS。这些结果大大提高了当前有关该问题的知识。获得这些结果的关键思想是探索PMWGP和geometrick-means聚类问题之间的联系。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号