【24h】

The Curse of Dimensionality in Data Mining and Time Series Prediction

机译:数据挖掘和时间序列预测中的维度诅咒

获取原文
获取原文并翻译 | 示例

摘要

Modern data analysis tools have to work on high-dimensional data, whose components are not independently distributed. High-dimensional spaces show surprising, counter-intuitive geometrical properties that have a large influence on the performances of data analysis tools. Among these properties, the concentration of the norm phenomenon results in the fact that Euclidean norms and Gaussian kernels, both commonly used in models, become inappropriate in high-dimensional spaces. This papers presents alternative distance measures and kernels, together with geometrical methods to decrease the dimension of the space. The methodology is applied to a typical time series prediction example.
机译:现代数据分析工具必须处理高维度数据,这些数据的组成不是独立分布的。高维空间显示出令人惊讶的,违反直觉的几何特性,这些特性对数据分析工具的性能有很大的影响。在这些特性中,规范现象的集中导致以下事实:模型中常用的欧几里得规范和高斯核在高维空间中变得不合适。本文介绍了可选的距离度量和核,以及减小空间尺寸的几何方法。该方法应用于典型的时间序列预测示例。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号