首页> 外文期刊>The international arab journal of information technology >Comparison of Dimension Reduction Techniques on High Dimensional Datasets
【24h】

Comparison of Dimension Reduction Techniques on High Dimensional Datasets

机译:高维数据集的降维技术比较

获取原文
获取原文并翻译 | 示例
       

摘要

High dimensional data becomes very common with the rapid growth of data that has been stored in databases or other information areas. Thus clustering process became an urgent problem. The well-known clustering algorithms are not adequate for the high dimensional space because of the problem that is called curse of dimensionality. So dimensionality reduction techniques have been used for accurate clustering results and improve the clustering time in high dimensional space. In this work different dimensionality reduction techniques were combined with Fuzzy C-Means clustering algorithm. It is aimed to reduce the complexity of high dimensional datasets and to generate more accurate clustering results. The results were compared in terms of cluster purity, cluster entropy and mutual info. Dimension reduction techniques are compared with current Central Processing Unit (CPU), current memory and elapsed CPU time. The experiments showed that the proposed work produces promising results on high dimensional space.
机译:随着存储在数据库或其他信息区域中的数据的快速增长,高维数据变得非常普遍。因此,集群化过程成为迫在眉睫的问题。由于称为维数诅咒的问题,众所周知的聚类算法不适用于高维空间。因此,降维技术已用于获得准确的聚类结果,并改善了高维空间中的聚类时间。在这项工作中,不同的降维技术与模糊C均值聚类算法相结合。目的是降低高维数据集的复杂度并生成更准确的聚类结果。对结果进行了簇纯度,簇熵和互信息的比较。将降维技术与当前的中央处理器(CPU),当前的内存和已用的CPU时间进行了比较。实验表明,提出的工作在高维空间上产生了可喜的结果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号