...
首页> 外文期刊>Machine Learning Research >Unsupervised Dimensionality Reduction for High-Dimensional Data Classification
【24h】

Unsupervised Dimensionality Reduction for High-Dimensional Data Classification

机译:高维数据分类的无监督降维

获取原文

摘要

This paper carries on research surrounding the influences produced by dimensionality reduction on machine learning classification effect. Firstly, paper constructs the analysis architecture of data dimension reduction classification, combines the two different unsupervised dimension reduction methods, locally linear embedding (LLE) and principal component analysis (PCA) with the five machine learning classification methods: Gradient Boosting Decision Tree (GBDT), Random Forest, Support Vector Machine (SVM), K-Nearest Neighbor (KNN) and Logistic Regression. And then uses the handwritten digital identification dataset to analyze the classification performance of these five classification methods on different dimension datasets by different dimensionality reduction methods. The analysis shows that using the appropriate dimensionality reduction method for dimensionality reduction classification can effectively improve the classification accuracy; the dimensionality reduction classification effect of non-linear dimensionality reduction method is generally better than the linear dimensionality reduction method; different machine learning classification algorithms have significant differences in the sensitivity of dimensions.
机译:本文围绕降维对机器学习分类效果的影响进行了研究。首先,本文构建了数据降维分类的分析架构,将两种不同的无监督降维方法(局部线性嵌入(LLE)和主成分分析(PCA))与五种机器学习分类方法相结合:梯度提升决策树(GBDT) ,随机森林,支持向量机(SVM),K最近邻(KNN)和Logistic回归。然后使用手写数字识别数据集,通过不同的降维方法,分析了这五种分类方法在不同维度数据集上的分类性能。分析表明,采用适当的降维方法进行降维分类可以有效提高分类精度。非线性降维方法的降维分类效果一般要优于线性降维方法。不同的机器学习分类算法在尺寸敏感性上有显着差异。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号