首页> 外文期刊>Journal of Computational Physics >Iterative algorithms for the post-processing of high-dimensional data
【24h】

Iterative algorithms for the post-processing of high-dimensional data

机译:高维数据后处理的迭代算法

获取原文
获取原文并翻译 | 示例
           

摘要

Scientific computations or measurements may result in huge volumes of high-dimensional data, for instance 10(20) or 100(300) elements. Often these can be thought of representing a real-valued function on a high-dimensional domain. In this and also in most other cases the data can be conceptually arranged in the format of a tensor of high degree, and stored in some truncated or lossy compressed format. We look at some common post-processing tasks which are too time and storage consuming in the uncompressed data format and not obvious in the compressed format, as such huge data sets can not be stored in their entirety, and the value of an element is not readily accessible through simple look-up. The tasks we consider are finding the location of maximum or minimum, or finding all elements in some interval - i.e. level sets, or the number of elements with a value in such a level set, the probability of an element being in a particular level set, and the mean and variance of the total collection. The algorithms to be described are fixed point iterations of particular point-wise functions of the data, which will then exhibit the desired result. To formulate these algorithms, the data is considered as an element of a commutative algebra, and in an abstract sense, the algorithms, which only use these algebraic operations, are independent of the representation. We allow the actual computational representation to be a lossy compression, and we allow the algebra operations to be performed in an approximate fashion, so as to maintain a high compression level. One such example format which is addressed explicitly and described in some detail is the representation of the data as a tensor with compression in the form of a low-rank representation. (C) 2020 Elsevier Inc. All rights reserved.
机译:科学计算或测量可能导致大量的高维数据,例如10(20)或100(300)元件。通常可以认为这些可以在高维域中表示实际值函数。在此以及在大多数其他情况下,数据可以概念性地以高度的张量的格式概念性地布置,并以截断或有损的压缩格式存储。我们查看一些常见的后处理任务,这是不压缩的数据格式的时间和储存,并且压缩格式不明显,因为这种巨大的数据集不能完全存储,并且元素的值不是通过简单的查找,可以随时访问。我们考虑的任务正在找到最大或最小的位置,或在某些间隔 - 即级别集中找到所有元素,或在这种级别集中的值的元素数,元素在特定级别集中的概率,总收集的平均值和方差。要描述的算法是数据的特定点明智功能的固定点迭代,然后将表现出所需的结果。为了制定这些算法,数据被认为是换向代数的元素,并且在抽象意义上,仅使用这些代数操作的算法与表示无关。我们允许实际计算表示成为有损压缩,并且我们允许以近似的方式执行代数操作,以便保持高压缩水平。在一些细节中明确地解决的一种这样的示例格式是作为具有低秩表示形式的压缩的数据的数据的表示。 (c)2020 Elsevier Inc.保留所有权利。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号