...
首页> 外文期刊>ACM transactions on knowledge discovery from data >Context-Based Evaluation of Dimensionality Reduction Algorithms-Experiments and Statistical Significance Analysis
【24h】

Context-Based Evaluation of Dimensionality Reduction Algorithms-Experiments and Statistical Significance Analysis

机译:基于背景的维度降低算法评估 - 实验和统计显着性分析

获取原文
获取原文并翻译 | 示例

摘要

Dimensionality reduction is a commonly used technique in data analytics. Reducing the dimensionality of datasets helps not only with managing their analytical complexity but also with removing redundancy. Over the years, several such algorithms have been proposed with their aims ranging from generating simple linear projections to complex non-linear transformations of the input data. Subsequently, researchers have defined several quality metrics in order to evaluate the performances of different algorithms. Hence, given a plethora of dimensionality reduction algorithms and metrics for their quality analysis, there is a long-existing need for guidelines on how to select the most appropriate algorithm in a given scenario. In order to bridge this gap, in this article, we have compiled 12 state-of-the-art quality metrics and categorized them into 5 identified analytical contexts. Furthermore, we assessed 15 most popular dimensionality reduction algorithms on the chosen quality metrics using a large-scale and systematic experimental study. Later, using a set of robust non-parametric statistical tests, we assessed the generalizability of our evaluation on 40 real-world datasets. Finally, based on our results, we present practitioners' guidelines for the selection of an appropriate dimensionally reduction algorithm in the present analytical contexts.
机译:Dimennality Dreaks是数据分析中常用的技术。减少数据集的维度不仅有助于管理分析复杂性,而且有助于删除冗余。多年来,已经提出了几种这样的算法,其目的是从产生简单的线性投影到输入数据的复杂非线性变换。随后,研究人员已经确定了几种质量指标,以评估不同算法的性能。因此,考虑到过多的维数减少算法和质量分析的度量,有一个关于如何在给定场景中选择最合适的算法的指导方针的长期存在。为了弥合这一差距,在本文中,我们编制了12个最先进的质量指标,并将它们分为5个识别的分析背景。此外,我们使用大规模和系统的实验研究评估了所选质量指标的15个最受欢迎的维度减少算法。后来,使用一套强大的非参数统计测试,我们评估了我们对40个现实世界数据集的评估的普遍性。最后,根据我们的结果,我们提出了从业者在本分析背景下选择适当的尺寸减少算法的指导。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号