首页> 外文会议>Industrial conference on data mining >A Novel Sparsity Based Classification Framework to Exploit Clusters in Data
【24h】

A Novel Sparsity Based Classification Framework to Exploit Clusters in Data

机译:基于稀疏性的新型分类框架,用于数据簇的挖掘

获取原文

摘要

A huge recent advance in machine learning has been the usage of sparsity as a guiding principle to perform classification. Traditionally sparsity has been used to exploit a property of high dimensional vectors-which is that, vectors of the same class lie on or near the same low dimensional subspace within an ambient high dimensional space-this is seen in algorithms like Basis Pursuit, and Sparse Representation classifier. In this paper we use sparsity to exploit a different property of data, which is that data points belonging to the same class constitute a cluster. Here classification is done by determining which cluster's vectors can best convexly approximate the given test vector. So if the vectors of cluster 'A' best approximate or realise the given test vector, then label 'A' is assigned as its class. The problem of finding the best approximate is framed as a t norm minimization problem with convex constraints. The optimization framework of the proposed algorithm is convex in nature making the classification algorithm tractable. The proposed algorithm is evaluated by comparing its accuracy with the accuracy of other popular machine learning algorithms on a diverse collection of real datasets. The proposed algorithm on an average provides a 10 % improvement in accuracy over certain standard machine learning algorithms.
机译:机器学习方面最近的一项巨大进步是使用稀疏性作为执行分类的指导原则。传统上,稀疏性已被用于开发高维向量的属性-即,相同类别的向量位于环境高维空间内的同一低维子空间上或附近-这在诸如Basis Pursuit和Sparse之类的算法中可以看到表示分类器。在本文中,我们使用稀疏性来利用数据的不同属性,即属于同一类的数据点构成一个群集。这里的分类是通过确定哪个簇的矢量可以最佳地凸近似给定的测试矢量来完成的。因此,如果群集“ A”的向量最接近或实现了给定的测试向量,则将标签“ A”指定为其类别。找到最佳近似值的问题被构造为具有凸约束的t \范数最小化问题。所提出算法的优化框架本质上是凸的,从而使分类算法易于处理。通过在各种真实数据集上将其算法的准确性与其他流行的机器学习算法的准确性进行比较来评估该算法。与某些标准的机器学习算法相比,所提出的算法平均可将精度提高10%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号