首页> 外文期刊>IEEE Transactions on Information Theory >Nearest Neighbors for Matrix Estimation Interpreted as Blind Regression for Latent Variable Model
【24h】

Nearest Neighbors for Matrix Estimation Interpreted as Blind Regression for Latent Variable Model

机译:潜在变量模型的矩阵估计的最近邻被解释为盲回归

获取原文
获取原文并翻译 | 示例
获取外文期刊封面目录资料

摘要

We consider the setup of nonparametric blind regression for estimating the entries of a large m imes n matrix, when provided with a small, random fraction of noisy measurements. We assume that all rows u in [m] and columns i in [n] of the matrix are associated to latent features x_ext row(u) and x_ext col(i) respectively, and the (it u, i) -th entry of the matrix, A(it u, i) is equal to f(x_ext row(u), x_ext col(i)) for a latent function f . Given noisy observations of a small, random subset of the matrix entries, our goal is to estimate the unobserved entries of the matrix as well as to "de-noise" the observed entries. As the main result of this work, we introduce a nearest-neighbor-based estimation algorithm, and establish its consistency when the underlying latent function f is Lipschitz, the underlying latent space is a bounded diameter Polish space, and the random fraction of observed entries in the matrix is at least max ig (m-1 + delta , n-1/2 + delta ig) , for any delta > 0 . As an important byproduct, our analysis sheds light into the performance of the classical collaborative filtering algorithm for matrix completion, which has been widely utilized in practice. Experiments with the MovieLens and Netflix datasets suggest that our algorithm provides a principled improvement over basic collaborative filtering and is competitive with matrix factorization methods. Our algorithm has a natural extension to the setting of tensor completion via flattening the tensor to matrix. When applied to the setting of image in-painting, which is a 3-order tensor, we find that our approach is competitive with respect to state-of-art tensor completion algorithms across benchmark images.
机译:当提供一个小的随机分数的噪声测量值时,我们考虑使用非参数盲回归来估计一个大的m×n矩阵的项。我们假设矩阵的所有行u in [m]和列i in [n]分别与潜在特征x_ text row(u)和x_ text col(i)以及( it u ,i)矩阵的第一个项A( it u,i)等于f(x_ text row(u),x_ text col(i))的潜函数f。给定一个小的随机子集的矩阵项的嘈杂观测值,我们的目标是估计矩阵中未观测到的项以及对观测到的项进行“去噪”。作为这项工作的主要结果,我们引入了基于最近邻的估计算法,并在基础潜在函数f为Lipschitz,基础潜在空间为有界直径波兰空间以及观察到的项的随机分数时建立了其一致性。对于任何 delta> 0,矩阵中的至少是 max big(m-1 + delta,n-1 / 2 + delta big)。作为重要的副产品,我们的分析揭示了经典协同过滤算法用于矩阵完成的性能,该算法在实践中已得到广泛利用。使用MovieLens和Netflix数据集进行的实验表明,我们的算法对基本协作过滤提供了原则上的改进,并且与矩阵分解方法相比具有竞争优势。我们的算法通过将张量展平为矩阵,自然扩展了张量完成的设置。当应用于3阶张量的图像绘制设置时,我们发现我们的方法相对于跨基准图像的最新张量完成算法具有竞争力。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号