首页> 外文会议>International Conference on Data Management Technologies and Applications >Incorporating Feature Selection and Clustering Approaches for High-Dimensional Data Reduction
【24h】

Incorporating Feature Selection and Clustering Approaches for High-Dimensional Data Reduction

机译:结合特征选择和聚类方法,可实现高维数据减少

获取原文

摘要

Data reduction is an important research topic for analyzing mass data efficiently and effectively in the era of big data. The task of dimension reduction is usually accomplished by technologies of feature selection, feature clustering or algebraic transformation. A novel approach for reducing high-dimensional data is initiated in this paper. The main idea of the proposed scheme is to incorporate data clustering and feature selection to transform high-dimensional data into lower dimensions. The incremental clustering algorithm in the scheme is used to handle the number of dimensions, and the relative discriminant variable is design for selecting significant features. Finally, a simple inner product operation is applied to transform original highdimensional data into a low one. Evaluations are conducted by testing the reduction approach on the problem of document categorization. The experimental results show that the reduced data have high classification accuracy for most of datasets. For some special datasets, the reduced data can get higher classification accuracy in comparison with original data.
机译:数据减少是有效,有效地在大数据时代分析质量数据的重要研究主题。尺寸减少的任务通常是通过特征选择,特征聚类或代数转换的技术完成的。本文启动了一种降低高维数据的新方法。所提出的方案的主要思想是纳入数据聚类和特征选择以将高维数据转换为较低的维度。方案中的增量聚类算法用于处理维度的数量,相对判别变量是用于选择有效特征的设计。最后,应用简单的内部产品操作以将原始高度数据转换为低电平。通过测试文件分类问题的减少方法进行评估。实验结果表明,减少数据对大多数数据集的高分类精度。对于某些特殊数据集,减少数据可以获得与原始数据相比的分类准确性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号