首页> 外文期刊>International Journal of Applied Mathematics and Computer Science >AN ALGORITHM FOR REDUCING THE DIMENSION AND SIZE OF A SAMPLE FOR DATA EXPLORATION PROCEDURES
【24h】

AN ALGORITHM FOR REDUCING THE DIMENSION AND SIZE OF A SAMPLE FOR DATA EXPLORATION PROCEDURES

机译:减少数据探查程序样本的尺寸和大小的算法

获取原文
获取原文并翻译 | 示例
           

摘要

The paper deals with the issue of reducing the dimension and size of a data set (random sample) for exploratory data analysis procedures. The concept of the algorithm investigated here is based on linear transformation to a space of a smaller dimension, while retaining as much as possible the same distances between particular elements. Elements of the transformation matrix are computed using the metaheuristics of parallel fast simulated annealing. Moreover, elimination of or a decrease in importance is performed on those data set elements which have undergone a significant change in location in relation to the others. The presented method can have universal application in a wide range of data exploration problems, offering flexible customization, possibility of use in a dynamic data environment, and comparable or better performance with regards to the principal component analysis. Its positive features were verified in detail for the domain's fundamental tasks of clustering, classification and detection of atypical elements (outliers).
机译:本文探讨了减少探索性数据分析程序的数据集(随机样本)的尺寸和大小的问题。此处研究的算法的概念基于对较小尺寸空间的线性变换,同时在特定元素之间尽可能保留相同的距离。转换矩阵的元素是使用并行快速模拟退火的元启发法计算的。此外,对那些位置相对于其他位置发生了重大变化的数据集元素执行消除或降低重要性的操作。所提出的方法可以在广泛的数据探索问题中得到普遍应用,提供灵活的自定义,在动态数据环境中使用的可能性以及就主成分分析而言可比或更好的性能。对于该域的非典型元素聚类,分类和检测(离群值)的基本任务,已对其详细特征进行了验证。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号