...
首页> 外文期刊>The American statistician >Rating Movies and Rating the Raters Who Rate Them
【24h】

Rating Movies and Rating the Raters Who Rate Them

机译:对电影进行评级并对对其进行评级的评级者评级

获取原文
获取原文并翻译 | 示例
           

摘要

The movie distribution company Netflix has generated considerable buzz in the statistics community by offering a million dollar prize for improvements to its movie rating system. Among the statisticians and computer scientists who have disclosed their techniques, the emphasis has been on machine learning approaches. This article has the modest goal of discussing a simple model for movie rating and other forms of democratic rating. Because the model involves a large number of parameters, it is nontrivial to carry out maximum likelihood estimation. Here we derive a straightforward EM algorithm from the perspective of the more general MM algorithm. The algorithm is capable of finding the global maximum on a likelihood landscape littered with inferior modes. We apply two variants of the model to a dataset from the MovieLens archive and compare their results. Our model identifies quirky raters, redefines the raw rankings, and permits imputation of missing ratings. The model is intended to stimulate discussion and development of better theory rather than to win the prize. It has the added benefit of introducing readers to some of the issues connected with analyzing high-dimensional data.
机译:电影发行公司Netflix通过提供一百万美元的奖金来改善其电影分级系统,在统计界引起了轩然大波。在公开他们的技术的统计学家和计算机科学家中,重点一直放在机器学习方法上。本文的目标是讨论电影分级和其他形式的民主分级的简单模型。由于该模型涉及大量参数,因此进行最大似然估计并非易事。在这里,我们从更通用的MM算法的角度派生出一种简单的EM算法。该算法能够在乱七八糟的低等模式下找到全局最大值。我们将模型的两个变体应用于MovieLens存档中的数据集,并比较它们的结果。我们的模型可以识别古怪的评分者,重新定义原始排名,并允许估算缺失的评分。该模型旨在激发更好的理论的讨论和发展,而不是赢得大奖。它具有向读者介绍与分析高维数据有关的一些问题的附加好处。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号