...
首页> 外文期刊>Computer Science & Information Technology >Towards Reduction Of Data Flow in A Distributed Network Using Principal Component Analysis
【24h】

Towards Reduction Of Data Flow in A Distributed Network Using Principal Component Analysis

机译:主成分分析法在分布式网络中减少数据流

获取原文
           

摘要

For performing distributed data mining two approaches are possible: First, data from several sources are copied to a data warehouse and mining algorithms are applied in it. Secondly, mining can performed at the local sites and the results can be aggregated. When the number of features is high, a lot of bandwidth is consumed in transferring datasets to a centralized location. For this dimensionality reduction can be done at the local sites. In dimensionality reduction a certain encoding is applied on data so as to obtain its compressed form. The reduced features thus obtained at the local sites are aggregated and data mining algorithms are applied on them. There are several methods of performing dimensionality reduction. Two most important ones are Discrete Wavelet Transforms (DWT) and Principal Component Analysis (PCA). Here a detailed study is done on how PCA could be useful in reducing data flow across a distributed network.
机译:为了执行分布式数据挖掘,可以采用两种方法:首先,将来自多个源的数据复制到数据仓库中,并在其中应用挖掘算法。其次,可以在本地站点进行挖掘,并且可以汇总结果。当特征数量很高时,将数据集传输到集中位置会消耗大量带宽。为此,可以在本地站点进行降维。在降维中,将某种编码应用于数据,以获得其压缩形式。这样在本地站点获得的简化特征被汇总,并在其上应用数据挖掘算法。有几种执行降维的方法。最重要的两个是离散小波变换(DWT)和主成分分析(PCA)。在这里完成了有关PCA如何在减少分布式网络中的数据流方面有用的详细研究。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号