Unsupervised Dimensionality Reduction for High-Dimensional Data Classification

Hany Yan; Hu Tianyu

首页> 外文期刊>Machine Learning Research >Unsupervised Dimensionality Reduction for High-Dimensional Data Classification

【24h】

Unsupervised Dimensionality Reduction for High-Dimensional Data Classification

机译：高维数据分类的无监督降维

获取原文

获取外文期刊封面封底 >>

开具论文收录证明 >>

文献代查 >>

团队文献服务 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

This paper carries on research surrounding the influences produced by dimensionality reduction on machine learning classification effect. Firstly, paper constructs the analysis architecture of data dimension reduction classification, combines the two different unsupervised dimension reduction methods, locally linear embedding (LLE) and principal component analysis (PCA) with the five machine learning classification methods: Gradient Boosting Decision Tree (GBDT), Random Forest, Support Vector Machine (SVM), K-Nearest Neighbor (KNN) and Logistic Regression. And then uses the handwritten digital identification dataset to analyze the classification performance of these five classification methods on different dimension datasets by different dimensionality reduction methods. The analysis shows that using the appropriate dimensionality reduction method for dimensionality reduction classification can effectively improve the classification accuracy; the dimensionality reduction classification effect of non-linear dimensionality reduction method is generally better than the linear dimensionality reduction method; different machine learning classification algorithms have significant differences in the sensitivity of dimensions.

机译：本文围绕降维对机器学习分类效果的影响进行了研究。首先，本文构建了数据降维分类的分析架构，将两种不同的无监督降维方法（局部线性嵌入（LLE）和主成分分析（PCA））与五种机器学习分类方法相结合：梯度提升决策树（GBDT），随机森林，支持向量机（SVM），K最近邻（KNN）和Logistic回归。然后使用手写数字识别数据集，通过不同的降维方法，分析了这五种分类方法在不同维度数据集上的分类性能。分析表明，采用适当的降维方法进行降维分类可以有效提高分类精度。非线性降维方法的降维分类效果一般要优于线性降维方法。不同的机器学习分类算法在尺寸敏感性上有显着差异。

著录项

来源
《Machine Learning Research 》 |2017年第4期| 共8页
作者
Hany Yan; Hu Tianyu;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种
中图分类计算技术、计算机技术 ;
关键词

相似文献

外文文献
中文文献
专利

1. Estimating gene expression from high-dimensional DNA methylation levels in cancer data: A bimodal unsupervised dimension reduction algorithm [J] . Damgacioglu Haluk, Celik Emrah, Celik Nurcin Computers & Industrial Engineering . 2019 ,第APRa期

机译：从癌症数据中的高维DNA甲基化水平估算基因表达：一种双峰无监督降维算法
2. UNSUPERVISED ADAPTATION FOR HIGH-DIMENSIONAL WITH LIMITED-SAMPLE DATA CLASSIFICATION USING VARIATIONAL AUTOENCODER [J] . Mahmud Mohammad Sultan, Huang Joshua Zhexue, Fu Xianghua, Computing and informatics . 2021 ,第1期

机译：使用变化性AutiaceCoder使用有限 - 样本数据分类无监督适应性的高维度
3. Unsupervised Linear Feature-Extraction Methods and Their Effects in the Classification of High-Dimensional Data [J] . Jimenez-Rodriguez L. O., Arzuaga-Cruz E., Velez-Reyes M. IEEE Transactions on Geoscience and Remote Sensing . 2007 ,第期

机译：无监督线性特征提取方法及其在高维数据分类中的作用
4. A Scalable Unsupervised Feature Merging Approach to Efficient Dimensionality Reduction of High-Dimensional Visual Data [C] . Liu Lingqiao, Wang Lei IEEE International Conference on Computer Vision . 2013

机译：高维可视数据有效降维的可扩展无监督特征合并方法
5. Perturbed neural network backpropagation learning and adaptive wavelets for dimension reduction for improved classification of high-dimensional datasets. [D] . Bosch, Edward H. 2005

机译：扰动神经网络的反向传播学习和自适应小波用于降维，以改进高维数据集的分类。
6. Unsupervised discovery of temporal sequences in high-dimensional datasets with applications to neuroscience [O] . Emily L Mackevicius, Andrew H Bahle, Alex H Williams, -1

机译：高维数据集中时间序列的无监督发现及其在神经科学中的应用
7. A Scalable Unsupervised Feature Merging Approach to Efficient Dimensionality Reduction of High-dimensional Visual Data [O] . Lingqiao Liu, Lei Wang 2015

机译：一种可扩展的无监督特征合并方法，有效降低高维视觉数据的维数

Unsupervised Dimensionality Reduction for High-Dimensional Data Classification

摘要

著录项

相似文献

相关主题

期刊订阅