Personalized Prediction and Sparsity Pursuit in Latent Factor Models

Zhu Yunzhang; Shen Xiaotong; Ye Changqing

首页> 外文期刊>Journal of the American statistical association >Personalized Prediction and Sparsity Pursuit in Latent Factor Models

【24h】

Personalized Prediction and Sparsity Pursuit in Latent Factor Models

机译：潜在因子模型中的个性化预测和稀疏性追求

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Personalized information filtering extracts the information specifically relevant to a user, predicting his/her preference over a large number of items, based on the opinions of users who think alike or its content. This problem is cast into the framework of regression and classification, where we integrate additional user-specific and content-specific predictors in partial latent models, for higher predictive accuracy. In particular, we factorize a user-over-item preference matrix into a product of two matrices, each representing a user's preference and an item preference by users. Then we propose a likelihood method to seek a sparsest latent factorization, from a class of overcomplete factorizations, possibly with a high percentage of missing values. This promotes additional sparsity beyond rank reduction. Computationally, we design methods based on a decomposition and combination strategy, to break large-scale optimization into many small subproblems to solve in a recursive and parallel manner. On this basis, we implement the proposed methods through multi-platform shared-memory parallel programming, and through Mahout, a library for scalable machine learning and data mining, for mapReduce computation. For example, our methods are scalable to a dataset consisting of three billions of observations on a single machine with sufficient memory, having good timings. Both theoretical and numerical investigations show that the proposed methods exhibit a significant improvement in accuracy over state-of-the-art scalable methods. Supplementary materials for this article are available online.

机译：个性化信息过滤基于与自己想法相似的用户的意见或其内容，提取与用户特别相关的信息，从而预测他/她对大量商品的偏好。这个问题被放到回归和分类的框架中，在该框架中，我们将其他特定于用户和特定于内容的预测器集成到部分潜在模型中，以实现更高的预测精度。特别是，我们将用户优先项目偏好矩阵分解为两个矩阵的乘积，每个矩阵代表用户的偏好和用户的项目偏好。然后，我们提出了一种似然方法，该方法从一类过度完成的因式分解中寻找最稀疏的潜在因式分解，可能具有较高百分比的缺失值。除了降低等级之外，这还促进了额外的稀疏性。在计算上，我们基于分解和组合策略设计方法，以将大规模优化分解为许多小子问题，以递归和并行的方式进行求解。在此基础上，我们通过多平台共享内存并行编程，并通过Mahout（用于可伸缩机器学习和数据挖掘的库，用于mapReduce计算）来实现所提出的方法。例如，我们的方法可扩展到一个数据集，该数据集由一台机器上的30亿个观测值组成，具有足够的内存并具有良好的时序。理论和数值研究均表明，与最新的可缩放方法相比，所提出的方法在准确性方面有显着提高。可在线获得本文的补充材料。

著录项

来源
《Journal of the American statistical association》 |2016年第513期|241-252|共12页
作者
Zhu Yunzhang; Shen Xiaotong; Ye Changqing;
展开▼
作者单位

Univ Minnesota, Sch Stat, Minneapolis, MN 55455 USA;

Univ Minnesota, Sch Stat, Minneapolis, MN 55455 USA;

Univ Minnesota, Sch Stat, Minneapolis, MN 55455 USA;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Alternating directions; Collaborative filtering; Content-based filtering; Partial latent models; Recommender; Sparse factorization;

机译：交替方向;协作过滤;基于内容的过滤;局部隐模型;推荐;稀疏因子分解;

相似文献

外文文献
中文文献
专利

1. Integrating Topic and Latent Factors for Scalable Personalized Review-based Rating Prediction [J] . Wei Zhang, Jianyong Wang IEEE Transactions on Knowledge and Data Engineering . 2016,第11期

机译：整合主题和潜在因素，以实现可扩展的个性化基于审阅的评分预测
2. A Fast Parallelized Computational Approach Based on Sparse LU Factorization for Predictions of Spatial and Time-Dependent Currents and Voltages in Full-Body Biomodels [J] . Mishra A., Joshi R.P., Schoenbach K.H., IEEE Transactions on Plasma Science . 2006,第4期

机译：基于稀疏LU分解的快速并行计算方法，用于预测人体模型中时空相关的电流和电压
3. Towards integrated clinico-genomic models for personalized medicine: combining gene expression signatures and clinical factors in breast cancer outcomes prediction [J] . Joseph R. Nevins, Erich S. Huang, Holly Dressman, Human Molecular Genetics . 2003,第2期

机译：走向个性化医学的综合临床基因组模型：将基因表达特征和临床因素结合起来可预测乳腺癌的预后
4. Personalized QoS Prediction for Web Services Using Latent Factor Models [C] . Dongjin Yu, Yu Liu, Yueshen Xu, IEEE International Conference on Services Computing . 2014

机译：使用潜在因子模型的Web服务个性化QoS预测
5. Feature Selection and Personalized Modeling on Medical Adverse Outcome Prediction [D] . Dai, Qingqing. 2020

机译：医学不利结果预测的特征选择和个性化建模
6. Advanced Dietary Patterns Analysis Using Sparse Latent Factor Models in Young Adults [O] . Jaehyun Joo, Sinead A Williamson, Ana I Vazquez, -1

机译：青年人使用稀疏潜能模型的高级饮食模式分析
7. Latent Factor Prediction Pursuit for Rank Deficient Regressors [O] . Czogiel I., Luebke K., Weihs C. 2004

机译：秩不足的回归变量的潜在因子预测追踪
8. Fast, Parallelized Computational Approach Based on Sparse LU Factorization, for Predictions of Spatial and Time-Dependent Currents and Voltages in Full-Body Bio-Models [R] . Mishra, A. , Joshi, R. P. , Schoenbach, K. H. , 2006

机译：基于稀疏LU分解的快速并行计算方法，用于全身生物模型中空间和时间依赖电流和电压的预测

Personalized Prediction and Sparsity Pursuit in Latent Factor Models

摘要

著录项

相似文献

相关主题

期刊订阅