首页> 外文会议>2019 IEEE 3rd Information Technology, Networking, Electronic and Automation Control Conference >Research on Distributed Parallel Dimensionality Reduction Algorithm Based on PCA Algorithm
【24h】

Research on Distributed Parallel Dimensionality Reduction Algorithm Based on PCA Algorithm

机译:基于PCA算法的分布式并行降维算法研究

获取原文
获取原文并翻译 | 示例

摘要

PCA algorithm is a typical data dimensionality reduction method, which projects high-dimensional data to a lower-dimensional space to obtain a low-dimensional data set that can maximally represent these characteristics of the original data set. The PCA algorithm can effectively achieve dimensionality reduction for high-dimensional data and is widely used in various fields. Aimed at the tedious calculation process of PCA algorithm and the time-consuming of processing massive stream data, this paper proposes a distributed parallel dimensionality reduction algorithm that called DP-PCA by improving the PCA algorithm. Based on the theory of PCA algorithm, DP-PCA algorithm includes three parts of improvement research. Firstly, the original data set is preprocessed by using the “mean” method. Secondly, the solution process of correlation coefficient matrix is improved. Thirdly, this paper designs a distributed parallel dimensionality reduction scheme for DP-PCA algorithm. In addition, this paper deploys DP-PCA algorithm on Storm platform to realize parallelization of the algorithm, and tests the DP-PCA algorithm. Experiments show that DP-PCA algorithm improves computational efficiency and reduces the dimensionality reduction time, and improves the speedup ratio.
机译:PCA算法是一种典型的数据降维方法,它将高维数据投影到低维空间以获得低维数据集,该数据集可以最大程度地代表原始数据集的这些特征。 PCA算法可以有效地实现高维数据的降维,并广泛应用于各个领域。针对PCA算法的繁琐计算过程以及处理海量流数据的耗时问题,提出了一种分布式并行降维算法,该算法通过对PCA算法的改进而称为DP-PCA。基于PCA算法的理论,DP-PCA算法包括三个方面的改进研究。首先,使用“均值”方法对原始数据集进行预处理。其次,改进了相关系数矩阵的求解过程。第三,针对DP-PCA算法设计了一种分布式并行降维方案。此外,本文在Storm平台上部署了DP-PCA算法,以实现该算法的并行化,并对DP-PCA算法进行了测试。实验表明,DP-PCA算法提高了计算效率,减少了降维时间,提高了加速比。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号