首页> 外文期刊>SIAM Journal on Matrix Analysis and Applications >TENSOR-CUR DECOMPOSITIONS FOR TENSOR-BASED DATA
【24h】

TENSOR-CUR DECOMPOSITIONS FOR TENSOR-BASED DATA

机译:基于Tensor的数据的Tensor-Cur分解

获取原文
获取原文并翻译 | 示例
           

摘要

Motivated by numerous applications in which the data may be modeled by a variable subscripted by three or more indices, we develop a tensor-based extension of the matrix CUR decomposition. The tensor-CUR decomposition is most relevant as a data analysis tool when the data consist of one mode that is qualitatively different from the others. In this case, the tensor-CUR decomposition approximately expresses the original data tensor in terms of a basis consisting of underlying subtensors that are actual data elements and thus that have a natural interpretation in terms of the processes generating the data. Assume the data may be modeled as a (2+1)-tensor, i.e., an m x n x p tensor A in which the first two modes are similar and the third is qualitatively different. We refer to each of the p different m x n matrices as "slabs" and each of the mn different p-vectors as "fibers." In this case, the tensor-CUR algorithm computes an approximation to the data tensor A that is of the form CUR, where C is an m x n x c tensor consisting of a small number c of the slabs, 12 is an r x p matrix consisting of a small number r of the fibers, and U is an appropriately defined and easily computed c x r encoding matrix. Both C and R may be chosen by randomly sampling either slabs or fibers according to a judiciously chosen and data-dependent probability distribution, and both c and r depend on a rank parameter k, an error parameter e, and a failure probability 6. Under appropriate assumptions, provable bounds on the Frobenius norm of the error tensor A _ CUR are obtained. In order to demonstrate the general applicability of this tensor decomposition, we apply it to problems in two diverse domains of data analysis: hyperspectral medical image analysis and consumer recommendation system analysis. In the hyperspectral data application, the tensor-CUR decomposition is used to compress the data, and we show that classification quality is not substantially reduced even after substantial data compression. In the recommendation system application, the tensor-CUR decomposition is used to reconstruct missing entries in a user-product-product preference tensor, and we show that high quality recommendations can be made on the basis of a small number of basis users and a small number of product-product comparisons from a new user.
机译:出于众多应用的推动,在这些应用中,数据可能由三个或多个下标的变量建模,我们开发了基于张量的矩阵CUR分解的扩展。当数据包含一种在性质上与另一种不同的模式时,张量-CUR分解与数据分析工具最为相关。在这种情况下,张量-CUR分解根据包括作为实际数据元素的基础子张量组成的基础近似表示原始数据张量,因此就生成数据的过程而言具有自然的解释。假设可以将数据建模为(2 + 1)张量,即m x n x p张量A,其中前两种模式相似而第三种在质量上不同。我们将p个不同的m x n个矩阵中的每一个称为“平板”,将mn个不同的p向量中的每一个称为“纤维”。在这种情况下,张量-CUR算法计算数据张量A的近似值,其形式为CUR,其中C是由少量c平板组成的mxnxc张量,12是由少量c组成的rxp矩阵光纤的r,U是适当定义和易于计算的cxr编码矩阵。 C和R均可通过根据明智选择且与数据相关的概率分布对板或纤维进行随机采样来选择,而c和r均取决于等级参数k,误差参数e和失效概率6。通过适当的假设,可以得出误差张量A _ CUR的Frobenius范数的可证明范围。为了证明此张量分解的一般适用性,我们将其应用于数据分析的两个不同领域中的问题:高光谱医学图像分析和消费者推荐系统分析。在高光谱数据应用中,张量-CUR分解用于压缩数据,并且我们显示,即使经过大量数据压缩,分类质量也不会显着降低。在推荐系统应用程序中,张量-CUR分解用于重构用户-产品-产品偏好张量中的缺失条目,并且我们证明了可以在少量基础用户和少量基础上做出高质量推荐新用户进行产品与产品比较的次数。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号