Nearest Neighbors for Matrix Estimation Interpreted as Blind Regression for Latent Variable Model

首页> 外文期刊>IEEE Transactions on Information Theory >Nearest Neighbors for Matrix Estimation Interpreted as Blind Regression for Latent Variable Model

【24h】

Nearest Neighbors for Matrix Estimation Interpreted as Blind Regression for Latent Variable Model

机译：潜在变量模型的矩阵估计的最近邻被解释为盲回归

获取原文

获取原文并翻译 | 示例

获取外文期刊封面目录资料

开具论文收录证明 >>

文献代查 >>

文献数据库（团队版） >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

We consider the setup of nonparametric blind regression for estimating the entries of a large m imes n matrix, when provided with a small, random fraction of noisy measurements. We assume that all rows u in [m] and columns i in [n] of the matrix are associated to latent features x_ext row(u) and x_ext col(i) respectively, and the (it u, i) -th entry of the matrix, A(it u, i) is equal to f(x_ext row(u), x_ext col(i)) for a latent function f . Given noisy observations of a small, random subset of the matrix entries, our goal is to estimate the unobserved entries of the matrix as well as to "de-noise" the observed entries. As the main result of this work, we introduce a nearest-neighbor-based estimation algorithm, and establish its consistency when the underlying latent function f is Lipschitz, the underlying latent space is a bounded diameter Polish space, and the random fraction of observed entries in the matrix is at least max ig (m-1 + delta , n-1/2 + delta ig) , for any delta > 0 . As an important byproduct, our analysis sheds light into the performance of the classical collaborative filtering algorithm for matrix completion, which has been widely utilized in practice. Experiments with the MovieLens and Netflix datasets suggest that our algorithm provides a principled improvement over basic collaborative filtering and is competitive with matrix factorization methods. Our algorithm has a natural extension to the setting of tensor completion via flattening the tensor to matrix. When applied to the setting of image in-painting, which is a 3-order tensor, we find that our approach is competitive with respect to state-of-art tensor completion algorithms across benchmark images.

机译：当提供一个小的随机分数的噪声测量值时，我们考虑使用非参数盲回归来估计一个大的m×n矩阵的项。我们假设矩阵的所有行u in [m]和列i in [n]分别与潜在特征x_ text row（u）和x_ text col（i）以及（ it u ，i）矩阵的第一个项A（ it u，i）等于f（x_ text row（u），x_ text col（i））的潜函数f。给定一个小的随机子集的矩阵项的嘈杂观测值，我们的目标是估计矩阵中未观测到的项以及对观测到的项进行“去噪”。作为这项工作的主要结果，我们引入了基于最近邻的估计算法，并在基础潜在函数f为Lipschitz，基础潜在空间为有界直径波兰空间以及观察到的项的随机分数时建立了其一致性。对于任何 delta> 0，矩阵中的至少是 max big（m-1 + delta，n-1 / 2 + delta big）。作为重要的副产品，我们的分析揭示了经典协同过滤算法用于矩阵完成的性能，该算法在实践中已得到广泛利用。使用MovieLens和Netflix数据集进行的实验表明，我们的算法对基本协作过滤提供了原则上的改进，并且与矩阵分解方法相比具有竞争优势。我们的算法通过将张量展平为矩阵，自然扩展了张量完成的设置。当应用于3阶张量的图像绘制设置时，我们发现我们的方法相对于跨基准图像的最新张量完成算法具有竞争力。

著录项

来源
《IEEE Transactions on Information Theory》 |2020年第3期|1760-1784|共25页
作者

展开▼
作者单位

MIT Dept Elect Engn & Comp Sci Lab Informat & Decis Syst Cambridge MA 02139 USA|Google Mountain View CA 94043 USA;

MIT Stat & Data Sci Ctr 77 Massachusetts Ave Cambridge MA 02139 USA;

MIT Dept Elect Engn & Comp Sci Lab Informat & Decis Syst Cambridge MA 02139 USA|MIT Dept Elect Engn & Comp Sci Cambridge MA 02139 USA;

MIT Dept Elect Engn & Comp Sci Lab Informat & Decis Syst Cambridge MA 02139 USA|Cornell Univ Sch Operat Res & Informat Engn Ithaca NY 14853 USA;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Collaboration; Matrix decomposition; Estimation; Prediction algorithms; Approximation algorithms; Noise measurement; Blind regression; matrix estimation; matrix completion; tensor estimation; tensor completion; latent variable model; collaborative filtering; nearest neighbor methods;

机译：合作;矩阵分解;估计;预测算法;近似算法;噪声测量;盲回归矩阵估计矩阵完成张量估计张量完成潜变量模型协同过滤最近邻法;

相似文献

外文文献
中文文献
专利

1. An Experiment to Model Spatial Diffusion Process with Nearest Neighbor Analysis and Regression Estimation [J] . Jay Lee, Jinn-Guey Lay, Wei Chien Benny Chin, International journal of applied geospatial research . 2014,第1期

机译：最近邻分析和回归估计的空间扩散过程建模实验
2. Estimation of a Latent Variable Regression Growth Curve Model for Individuals Cross-Classified by Clusters [J] . Leroux Audrey J., Beretvas S. Natasha Multivariate behavioral research . 2018,第2期

机译：集群交叉分类的个体潜在可变回归生长曲线模型的估计
3. Semiparametric latent variable model estimation with endogenous or mismeasured regressors [J] . Arthur Lewbel Econometrica . 1998,第1期

机译：具有内生或度量错误的回归变量的半参数潜变量模型估计
4. Blind Regression: Nonparametric Regression for Latent Variable Models via Collaborative Filtering [C] . Christina E. Lee, Yihua Li, Devavrat Shah, Annual conference on Neural Information Processing Systems . 2016

机译：盲回归：通过协同过滤的潜在变量模型的非参数回归
5. Comparison of the utility of regression analysis and k-nearest neighbor technique to estimate above-ground biomass in pine forests using Landsat ETM+ imagery. [D] . Prabhu, Chitra L. 2006

机译：使用Landsat ETM +图像进行回归分析和k最近邻技术估算松树林地上生物量的效用进行比较。
6. Remaining Useful Life Estimation of Insulated Gate Biploar Transistors (IGBTs) Based on a Novel Volterra k-Nearest Neighbor Optimally Pruned Extreme Learning Machine (VKOPP) Model Using Degradation Data [O] . Zhen Liu, Wenjuan Mei, Xianping Zeng, 2017

机译：基于新型Volterra k最近邻最优修剪极限学习机（VKOPP）模型的绝缘栅双极晶体管（IGBT）剩余寿命估算
7. The Multivariate k-Nearest Neighbor Model for Dependent Variables : One-Sided Estimation and Forecasting [O] . Guegan Dominique, Rakotomarolahy Patrick 2009

机译：因变量的多元k最近邻模型：单边估计和预测

Nearest Neighbors for Matrix Estimation Interpreted as Blind Regression for Latent Variable Model

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅