【24h】

Imputation of streaming low-rank tensor data

机译:流式低秩张量数据的插补

获取原文

摘要

Unraveling latent structure by means of multilinear models of tensor data is of paramount importance in timely inference tasks encountered with ‘Big Data’ analytics. However, increasingly noisy, heterogeneous, and incomplete datasets as well as the need for real-time processing of streaming data pose major challenges to this end. The present paper introduces a novel online (adaptive) algorithm to decompose low-rank tensors with missing entries, and perform imputation as a byproduct. The novel estimator minimizes an exponentially-weighted least-squares fitting error along with a separable regularizer of the PARAFAC decomposition factors, to trade-off fidelity for complexity of the approximation captured by the decomposition's rank. Leveraging stochastic gradient descent iterations, a scalable, real-time algorithm is developed and its convergence is established under simplifying technical assumptions. Simulated tests with cardiac magnetic resonance imagery (MRI) data confirm the efficacy of the proposed algorithm in imputing up to 75% missing entries.
机译:在“大数据”分析遇到的及时推理任务中,借助张量数据的多线性模型来揭示潜在结构至关重要。然而,越来越多的噪声,异构和不完整的数据集以及对流数据的实时处理的需求为此提出了重大挑战。本文介绍了一种新颖的在线(自适应)算法,该算法可分解缺少条目的低秩张量,并将插补作为副产品执行。新颖的估算器可将指数加权的最小二乘拟合误差与PARAFAC分解因子的可分正则化器一起最小化,以权衡保真度,以利于分解等级所捕获的近似值的复杂性。利用随机梯度下降迭代,开发了一种可扩展的实时算法,并在简化技术假设的前提下建立了其收敛性。使用心脏磁共振成像(MRI)数据进行的模拟测试证实了该算法在估算高达75%的缺失条目中的功效。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号