Efficient Rank Aggregation Using Partial Data

机译：使用部分数据进行有效的排名汇总

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

The need to rank items based on user input arises in many practical applications such as elections, group decision making and recommendation systems. The primary challenge in such scenarios is to decide on a global ranking based on partial preferences provided by users. The standard approach to address this challenge is to ask users to provide explicit-numerical ratings (cardinal information) of a subset of the items. The main appeal of such an approach is the ease of aggregation. However, the rating scale as well as the individual ratings are often arbitrary and may not be consistent from one user to another. A more natural alternative to numerical ratings requires users to compare pairs of items (ordinal information). On the one hand, such comparisons provide an "absolute" indicator of the user's preference. On the other hand, it is often hard to combine or aggregate these comparisons to obtain a consistent global ranking. In this work, we provide a tractable framework for utilizing comparison data as well as first-order marginal information (see Section 2) for the purpose of ranking. We treat the available information as partial samples from an unknown distribution over permutations. We then reduce ranking problems of interest to performing inference on this distribution. Specifically, we consider the problems of (a) finding an aggregate ranking of n items, (b) learning the mode of the distribution, and (c) identifying the top k items. For many of these problems, we provide efficient algorithms to infer the ranking directly from the data without the need to estimate the underlying distribution. In other cases, we use the Principle of Maximum Entropy to devise a concise parameterization of a distribution consistent with observations using only O(n~2) parameters, where n is the number of items in question. We propose a distributed, iterative algorithm for estimating the parameters of the distribution. We establish the correctness of the algorithm and identify its rate of convergence explicitly.

机译：在许多实际应用中，例如选举，小组决策和推荐系统中，都需要根据用户输入对项目进行排名。在这种情况下的主要挑战是基于用户提供的部分偏好来决定全局排名。解决此挑战的标准方法是要求用户提供项目子集的显式数字等级（基本信息）。这种方法的主要吸引力在于易于聚合。但是，评分等级以及各个评分通常是任意的，并且在一个用户与另一个用户之间可能不一致。数字评级的更自然替代方法要求用户比较项目对（常规信息）。一方面，这样的比较提供了用户偏好的“绝对”指示。另一方面，通常很难合并或汇总这些比较以获得一致的全球排名。在这项工作中，我们提供了一个易于处理的框架，用于利用比较数据以及一阶边际信息（请参阅第2节）进行排名。我们将可用信息视为来自排列的未知分布的部分样本。然后，我们减少了在此分布上进行推理所需的排名问题。具体来说，我们考虑以下问题：（a）找到n个项目的总排名；（b）学习分布方式；（c）确定排名前k个项目。对于许多此类问题，我们提供了有效的算法，可直接从数据中推断排名，而无需估计基础分布。在其他情况下，我们使用最大熵原理设计一个简明的分布参数化，该分布与仅使用O（n〜2）参数的观测值一致，其中n是所讨论的项数。我们提出了一种分布式的迭代算法来估计分布的参数。我们确定算法的正确性，并明确确定其收敛速度。

著录项

来源
《Proceedings of the ACM SIGMETRICS/PERFORMANCE joint international conference on measurement and modeling of computer systems》|2012年|355-366|共12页
会议地点 London(GB)
作者
Ammar Ammar; Devavrat Shah;
展开▼
作者单位

Massachusetts Institute of Technology 32 Vassar St. Cambridge, USA;

Massachusetts Institute of Technology 32 Vassar St. Cambridge, USA;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类
关键词
Hanking; Aggregation; Maximum Entropy;

机译：汉王聚合;最大熵;

相似文献

外文文献
中文文献
专利

1. Efficient Rank Aggregation Using Partial Data [J] . Ammar Ammar, Devavrat Shah Performance evaluation review . 2012,第1期

机译：使用部分数据进行有效的排名汇总
2. Data-driven Rank Breaking for Efficient Rank Aggregation [J] . Ashish Khetan, Sewoong Oh Journal of machine learning research . 2016,第193期

机译：数据驱动的排名突破，实现高效排名汇总
3. Data-driven Rank Breaking for Efficient Rank Aggregation [J] . Ashish Khetan, Sewoong Oh Journal of machine learning research . 2016,第193期

机译：数据驱动的排名突破，实现高效排名汇总
4. Efficient Rank Aggregation Using Partial Data [C] . Ammar Ammar, Devavrat Shah ACM SIGMETRICS/PERFORMANCE joint international conference on measurement and modeling of computer systems . 2012

机译：使用部分数据有效等级聚合
5. Picking the Best - A Numerical Study Using Balanced Incomplete Block Designs and Partial Rank Aggregation with Noisy Data [D] . Saenz, Ana Isabel 2019

机译：选择最佳-使用平衡的不完整块设计和带噪声数据的部分秩聚合进行的数值研究
6. A comparative study of rank aggregation methods for partial and top ranked lists in genomic applications [O] . Xue Li, Xinlei Wang, Guanghua Xiao -1

机译：基因组应用中部分和最高排名列表的排名聚合方法的比较研究
7. A Bayesian latent variable approach to aggregation of partial and top-ranked lists in genomic studies [O] . Xue Li, Pankaj Kumar Choudhary, Swati Biswas, 2018

机译：基因组研究中部分和排名列表聚集的贝叶斯潜在的变量方法
8. Sync-rank: Robust Ranking, Constrained Ranking and Rank Aggregation via Eigenvector and SDP Synchronization. [R] . Cucuringu, M. 2015

机译：同步排名：通过特征向量和sDp同步的稳健排名，约束排名和排名聚合。

Efficient Rank Aggregation Using Partial Data

摘要

著录项

相似文献

相关主题

期刊订阅