首页> 外文会议>IEEE High Performance Extreme Computing Conference >Multiscale Data Analysis Using Binning, Tensor Decompositions, and Backtracking
【24h】

Multiscale Data Analysis Using Binning, Tensor Decompositions, and Backtracking

机译:多尺度数据分析使用搭档,张量分解和回溯

获取原文

摘要

Large data sets can contain patterns at multiple scales (spatial, temporal, etc.). In practice, it is useful for data exploration techniques to detect patterns at each relevant scale. In this paper, we develop an approach to detect activities at multiple scales using tensor decomposition, an unsupervised high-dimensional data analysis technique that finds correlations between different features in the data. This method typically requires that the feature values are discretized during the construction of the tensor in a process called “binning.” We develop a method of constructing and decomposing tensors with different binning schemes of various features in order to uncover patterns across a set of user-defined scales. While binning is necessary to obtain interpretable results from tensor decompositions, it also decreases the specificity of the data. Thus, we develop backtracking methods that enable the recovery of original source data corresponding to patterns found in the decomposition. These technique are discussed in the context of spatiotemporal and network traffic data, and in particular on Automatic Identification System (AIS) data.
机译:大数据集可以包含多个尺度(空间,时间等)的模式。在实践中,对数据探测技术来检测每个相关规模的模式是有用的。在本文中,我们使用张量分解,开发一种方法来检测多个尺度的活动,这是一种无监视的高维数据分析技术,该技术在数据中发现不同特征之间的相关性。该方法通常要求在一个名为“Binning”的过程中的张量的构建期间离散化特征值。我们开发一种用各种特征的不同分子方案构建和分解张量的方法,以便在一组用户定义的尺度上揭示图案。虽然啤酒是必要的,但是从张量分解中获得可解释结果,但它也会降低数据的特异性。因此,我们开发回溯方法,使得能够恢复对应于分解中的模式的原始源数据。这些技术在时空和网络流量数据的上下文中讨论,特别是在自动识别系统(AIS)数据上。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号