首页> 外文会议>Software and cyberinfrastructure for astronomy conference >Shannon sampling and nonlinear dynamics on graphs for representation, regularization and visualization of complex data
【24h】

Shannon sampling and nonlinear dynamics on graphs for representation, regularization and visualization of complex data

机译:图上的Shannon采样和非线性动力学,用于表示,正则化和可视化复杂数据

获取原文

摘要

Data is now produced faster than it can be meaningfully analyzed. Many modern data sets present unprecedented analytical challenges, not merely because of their size but by their inherent complexity and information richness. Large numbers of astronomical objects now have dozens or hundreds of useful parameters describing each one. Traditional color-color plots using a limited number of symbols and some color-coding are clearly inadequate for finding all useful correlations given such large numbers of parameters. To capitalize on the opportunities provided by these data sets one needs to be able to organize, analyze and visualize them in fundamentally new ways. The identification and extraction of useful information in multiparametric, high-dimensional data sets - data mining - is greatly facilitated by finding simpler, that is, lower-dimensional abstract mathematical representations of the data sets that are more amenable to analysis. Dimensionality reduction consists of finding a lower-dimensional representation of high-dimensional data by constructing a set of basis functions that capture patterns intrinsic to a particular state space. Traditional methods of dimension reduction and pattern recognition often fail to work well when performed upon data sets as complex as those that now confront astronomy. We present here our developments of data compression, sampling, nonlinear dimensionality reduction, and clustering, which are important steps in the analysis of large-scale, complex datasets.
机译:现在,生成数据的速度超过了有意义的分析速度。许多现代数据集提出了前所未有的分析挑战,这不仅是因为它们的大小,还因为它们固有的复杂性和信息丰富性。现在,大量的天文物体具有数十个或数百个有用的参数来描述每个物体。使用有限数量的符号和一些颜色编码的传统彩色图表显然不足以在给定如此大量参数的情况下找到所有有用的相关性。为了利用这些数据集提供的机会,需要能够以根本上新的方式组织,分析和可视化它们。通过找到更易于分析的更简单的数据集(即数据集的低维抽象数学表示形式),极大地促进了多参数,高维数据集中有用信息的识别和提取(数据挖掘)。降维包括通过构造一组捕获特定状态空间固有模式的基函数来找到高维数据的低维表示。传统的降维和模式识别方法通常无法在像现在面临的天文学那样复杂的数据集上执行。在这里,我们介绍数据压缩,采样,非线性降维和聚类的发展,这是分析大型复杂数据集的重要步骤。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号