首页> 外文会议>IEEE Data Science Workshop >Online Tensor Decomposition and Imputation for Count Data
【24h】

Online Tensor Decomposition and Imputation for Count Data

机译:在线张量分解和计数数据的归纳

获取原文
获取外文期刊封面目录资料

摘要

Unveiling low-dimensional latent structure by means of multilinear decompositions of tensor data is central to data analytics tasks at the confluence of signal processing, machine learning and data mining. However, increasingly noisy, incomplete, and heterogeneous datasets (that deviate from e.g., Gaussian distributional assumptions) as well as the need for real-time processing of streaming data pose major challenges to this end. In this context, the present paper develops a novel online (adaptive) algorithm to obtain three-way decompositions of low-rank, Poisson-distributed tensors. Such (possibly incomplete) streams of count data arise with various applications including traffic engineering, computer network monitoring, genomics, photonics and satellite imaging. The proposed estimator minimizes a Poisson log-likelihood cost along with a separable regularizer of the PARAFAC decomposition factors, to trade-off fidelity for complexity of the approximation captured by the decomposition's rank. Leveraging stochastic gradient descent iterations, a scalable, online algorithm is developed to learn the decomposition factors on-the-fly and perform data imputation as a byproduct. Preliminary numerical tests with simulated data and solar flare video confirm the efficacy of the proposed tensor imputation algorithm, as well as its convergence to the batch estimator benchmark.
机译:揭幕低维由张量数据的多线性分解来潜结构是在信号处理,机器学习和数据挖掘的合流中央到数据分析任务。然而,越来越多嘈杂的,不完整的,异构数据集(从例如,高斯分布假设的偏离),以及需要对数据流进行实时处理带来为此重大挑战。在此背景下,本纸张在线开发一种新型的(自适应)算法,以获得低秩,泊松分布张量三通分解。这样(可能不完整)计数与各种应用,包括流量工程,计算机网络监控,基因组学,光电子和卫星成像数据产生的流。所提出的估计泊松数似然成本最小化与PARAFAC分解因素的可分离正则一起,以权衡保真度被分解的排名拍摄的近似的复杂性。利用随机梯度下降的迭代,可扩展,在线算法开发学习上即时分解因素和执行数据归集作为副产品。初步数值试验用模拟数据和太阳耀斑视频确认所提出的张量插补算法的有效性,以及它的收敛到批处理估计基准。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号