Statistical biases in Information Retrieval metrics for recommender systems

Bellogin Alejandro; Castells Pablo; Cantador Ivan

首页> 外文期刊>Information retrieval >Statistical biases in Information Retrieval metrics for recommender systems

【24h】

Statistical biases in Information Retrieval metrics for recommender systems

机译：推荐系统的信息检索指标中的统计偏差

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

There is an increasing consensus in the Recommender Systems community that the dominant error-based evaluation metrics are insufficient, and mostly inadequate, to properly assess the practical effectiveness of recommendations. Seeking to evaluate recommendation rankings-which largely determine the effective accuracy in matching user needs-rather than predicted rating values, Information Retrieval metrics have started to be applied for the evaluation of recommender systems. In this paper we analyse the main issues and potential divergences in the application of Information Retrieval methodologies to recommender system evaluation, and provide a systematic characterisation of experimental design alternatives for this adaptation. We lay out an experimental configuration framework upon which we identify and analyse specific statistical biases arising in the adaptation of Information Retrieval metrics to recommendation tasks, namely sparsity and popularity biases. These biases considerably distort the empirical measurements, hindering the interpretation and comparison of results across experiments. We develop a formal characterisation and analysis of the biases upon which we analyse their causes and main factors, as well as their impact on evaluation metrics under different experimental configurations, illustrating the theoretical findings with empirical evidence. We propose two experimental design approaches that effectively neutralise such biases to a large extent. We report experiments validating our proposed experimental variants, and comparing them to alternative approaches and metrics that have been defined in the literature with similar or related purposes.

机译：推荐系统社区中越来越多的共识是，基于错误的主要评估指标不足以且不足以正确评估建议的实际有效性。为了评估推荐等级（这在很大程度上决定了满足用户需求的有效准确性），而不是预测的等级值，信息检索指标已开始应用于推荐系统的评估。在本文中，我们分析了信息检索方法论在推荐系统评估中的主要问题和潜在的分歧，并为适应性实验提供了实验设计替代方案的系统表征。我们提出了一个实验性配置框架，在此框架上，我们可以识别和分析在将信息检索指标适应推荐任务时出现的特定统计偏差，即稀疏性和受欢迎度偏差。这些偏差极大地扭曲了经验测量结果，从而阻碍了整个实验结果的解释和比较。我们对偏差进行了正式的表征和分析，以此来分析其成因和主要因素，以及它们在不同实验配置下对评估指标的影响，以经验证据说明理论发现。我们提出了两种实验设计方法，可以在很大程度上有效抵消这种偏差。我们报告了一些实验，这些实验验证了我们提出的实验变体，并将它们与文献中出于相似或相关目的定义的替代方法和指标进行了比较。

著录项

来源
《Information retrieval》 |2017年第6期|606-634|共29页
作者
Bellogin Alejandro; Castells Pablo; Cantador Ivan;
展开▼
作者单位

Univ Autonoma Madrid, Madrid, Spain;

Univ Autonoma Madrid, Madrid, Spain;

Univ Autonoma Madrid, Madrid, Spain;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Evaluation; Recommender systems; Popularity bias; Sparsity bias; Cranfield;

机译：评估;推荐系统;大众偏见;稀疏偏见;克兰菲尔德;
入库时间 2022-08-17 23:16:14

相似文献

外文文献
中文文献
专利

1. Decision Biases in Recommender Systems [J] . ERICH CHRISTIAN TEPPAN, MARKUS ZANKER Journal of Internet Commerce . 2015,第1a4期

机译：推荐系统中的决策偏见
2. Trusting in Others' Biases: Fostering Guarded Trust in Collaborative Filtering and Recommender Systems [J] . Jo Ann Oravec Knowledge Technology & Policy . 2005,第3a4期

机译：信任他人的偏见：在协作过滤和推荐系统中建立受保护的信任
3. Directional statistics-based deep metric learning for image classification and retrieval [J] . Zhe Xuefei, Chen Shifeng, Yan Hong Pattern Recognition: The Journal of the Pattern Recognition Society . 2019,第期

机译：基于定向统计学的图像分类和检索的深度度量学习
4. Effect of linear biases in latent factor models on high-dimensional and sparse matrices from recommender systems [C] . Ye Yuan, Xin Luo, Ming-Sheng Shang, IEEE International Conference on Networking, Sensing and Control . 2017

机译：潜在因子模型中的线性偏差对推荐系统的高维和稀疏矩阵的影响
5. Review and Implementation of Common Statistical Methods for Recommender Systems [D] . McKeag, Candace Jennifer. 2021

机译：审查和实施推荐系统的常见统计方法
6. Individual Biases Cultural Evolution and the Statistical Nature of Language Universals: The Case of Colour Naming Systems [O] . Andrea Baronchelli, Vittorio Loreto, Andrea Puglisi -1

机译：个体偏见文化演变和语言共性的统计性质：颜色命名系统的案例
7. Statistical biases in Information Retrieval metrics for recommender systems [O] . Alejandro Bellogín, Pablo Castells, Iván Cantador 2017

机译：关于推荐系统的信息检索度量中的统计偏差
8. Statistical Approach for Estimating Intervals of Certification or Biases of Facilities or Measurement Systems Including Uncertainties [R] . Stern, F. , Olivieri, A. , Shao, J. , 2004

机译：估算包括不确定性在内的设施或测量系统的认证或偏差间隔的统计方法

Statistical biases in Information Retrieval metrics for recommender systems

摘要

著录项

相似文献

相关主题

期刊订阅