【24h】

Top-k preferences in high dimensions

机译:高尺寸的顶级偏乐

获取原文

摘要

Given a set of objects O, each with d numeric attributes, a top-k preference scores these objects using a linear combination of their attribute values, where the weight on each attribute reflects the interest in this attribute. Given a query preference q, a top-k query finds the k objects in O with highest scores with respect to q. Given a query object o and a set of preferences Q, a reverse top-k query finds all preferences q ∈ Q for which o becomes one of the top k objects with respect to q. Previous solutions to these problems are effective only in low dimensions. In this paper, we develop a solution for much higher dimensions (up to high tens), if many preferences exhibit sparsity—i.e., each specifies non-zero weights for only a handful (say 5–7) of attributes (though the subsets of such attributes and their weights can vary greatly). Our idea is to select carefully a set of low-dimensional core subspaces to “cover” the sparse preferences in a workload. These subspaces allow us to index them more effectively than the full-dimensional space. Being multi-dimensional, each subspace covers many possible preferences; furthermore, multiple subspaces can jointly cover a preference, thereby expanding the coverage beyond each subspace's dimensionality. Experimental evaluation validates our solution's effectiveness and advantages over previous solutions.
机译:给定一组对象O,每个对象o与​​d数字属性,top-k首选项使用其属性值的线性组合来分为这些对象,其中每个属性上的权重反映了此属性的兴趣。给定查询偏好Q,顶-K查询在o上找到o的o中的k个对象,相对于q。给定查询对象o和一组首选项q,反向top-k查询查找所有首选项q q q相对于q的顶部K对象之一。这些问题的先前解决方案仅在低维度下有效。在本文中,我们开发一个更高的尺寸(最高的尺寸)的解决方案,如果许多偏好表现出稀疏性 - 即,只要仅针对少数(比如5-7)属性(虽然子集)的非零权重这种属性及其重量可以很大差异)。我们的想法是仔细选择一组低维核心子空间,以“覆盖”工作负载中的稀疏偏好。这些子空间允许我们比全维空间更有效地索引它们。多维,每个子空间都涵盖了许多可能的偏好;此外,多个子空间可以共同覆盖偏好,从而扩展了每个子空间的维度的覆盖范围。实验评估验证我们解决方案的效果和优势对先前的解决方案。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号