【24h】

Multi-target Prediction via Low-Rank Embeddings

机译:通过低秩嵌入进行多目标预测

获取原文

摘要

Linear prediction methods, such as linear regression and classification, form the bread-and-butter of modem machine learning. The classical scenario is the presence of data with multiple features and a single target variable. However, there are many recent scenarios where there are multiple target variables. For example, recommender systems, predicting bid words for a web page (where each bid word acts as a target variable), or predicting diseases linked to a gene. In many of these scenarios, the target variables might themselves be associated with features. In these scenarios, bilinear and nonlinear prediction via low-rank embeddings have been shown to be extremely powerful. The low-rank embeddings serve a dual purpose: (i) they enable tractable computation even in the face of millions of data points as well as target variables, and (ii) they exploit correlations among the target variables, even when there are many missing observations. We illustrate our methodology on various modern machine learning problems: recommender systems, multi-label learning and inductive matrix completion, and present results on some standard benchmarks as well as an application that involves prediction of gene-disease associations.
机译:线性预测方法(例如线性回归和分类)形成了现代机器学习的基础。经典方案是存在具有多个功能和单个目标变量的数据。但是,最近有许多方案存在多个目标变量。例如,推荐系统,预测网页的出价词(每个出价词充当目标变量)或预测与基因相关的疾病。在许多情况下,目标变量本身可能与要素相关联。在这些情况下,通过低秩嵌入进行的双线性和非线性预测已显示出非常强大的功能。低秩嵌入具有双重目的:(i)即使面对数以百万计的数据点和目标变量,它们也可实现易于处理的计算;(ii)即使缺少许多目标变量,它们也可利用目标变量之间的相关性观察。我们将说明有关各种现代机器学习问题的方法:推荐系统,多标签学习和归纳矩阵完成,并在一些标准基准以及涉及预测基因疾病关联的应用中显示结果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号