首页> 外文会议>International Conference on Enterprise Information Systems >TOWARDS HIGH DIMENSIONAL DATA MINING WITH BOOSTING OF PSVM AND VISUALIZATION TOOLS
【24h】

TOWARDS HIGH DIMENSIONAL DATA MINING WITH BOOSTING OF PSVM AND VISUALIZATION TOOLS

机译:通过升压PSVM和可视化工具的高维数据挖掘

获取原文

摘要

We present a new supervised classification algorithm using boosting with support vector machines (SVM) and able to deal with very large data sets. Training a SVM usually needs a quadratic programming, so that the learning task for large data sets requires large memory capacity and a long time. Proximal SVM proposed by Fung and Mangasarian is another SVM formulation very fast to train because it requires only the solution of a linear system. We have used the Sherman-Morrison-Woodbury formula to adapt the PSVM to process data sets with a very large number of attributes. We have extended this idea by applying boosting to PSVM for mining massive data sets with simultaneously very large number of datapoints and attributes. We have evaluated its performance on several large data sets. We also propose a new graphical tool for trying to interpret the results of the new algorithm by displaying the separating frontier between classes of the data set. This can help the user to deeply understand how the new algorithm can work.
机译:我们使用支持向量机(SVM)提升并能够处理非常大的数据集的新监督分类算法。培训SVM通常需要二次编程,因此大数据集的学习任务需要大的内存容量和长时间。近端SVM由FUGG和MAGASARIAR提出的另一个SVM制剂非常快速地训练,因为它只需要线性系统的解决方案。我们使用了Sherman-Morrison-Woodbury公式来调整PSVM以使用大量属性处理数据集。我们通过应用于PSVM来挖掘挖掘大量数据集,同时使用大量的DataPoints和属性来扩展此想法。我们在几种大数据集中评估了其性能。我们还提出了一种新的图形工具,用于通过在数据集的类之间显示分离边防来解释新算法的结果。这可以帮助用户深入了解新算法如何工作。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号