首页> 外文会议>IEEE International Conference on Big Data >Detecting polarization in ratings: An automated pipeline and a preliminary quantification on several benchmark data sets
【24h】

Detecting polarization in ratings: An automated pipeline and a preliminary quantification on several benchmark data sets

机译:检测等级中的两极分化:自动建立管道并对几个基准数据集进行初步量化

获取原文

摘要

Personalized recommender systems are becoming increasingly relevant and important in the study of polarization and bias, given their widespread use in filtering information spaces. Polarization is a social phenomenon, with serious consequences, in real-life, particularly on social media. Thus it is important to understand how machine learning algorithms, especially recommender systems, behave in polarized environments. In this paper, we study polarization within the context of the users' interactions with a space of items and how this affects recommender systems. We first formalize the concept of polarization based on item ratings and then relate it to the item reviews to investigate any potential correlation. We then propose a domain independent data science pipeline to automatically detect polarization using the ratings rather than the typical properties used to detect polarization, such as item's content or social network topology. We perform an extensive comparison of polarization measures on several benchmark data sets and show that our polarization detection framework can detect different degrees of polarization and outperforms existing measures in capturing an intuitive notion of polarization. Our work is an essential step toward quantifying and detecting polarization in ongoing ratings and in benchmark data sets, and to this end, we use our developed polarization detection pipeline to compute the polarization prevalence of several benchmark data sets. It is our hope that this work will contribute to supporting future research in the emerging topic of designing and studying the behavior of recommender systems in polarized environments.
机译:鉴于个性化推荐系统在过滤信息空间中的广泛应用,在极化和偏向的研究中,它们变得越来越重要和重要。极化是一种社会现象,在现实生活中,尤其是在社交媒体上,会带来严重后果。因此,重要的是要了解机器学习算法,特别是推荐系统在极化环境中的行为。在本文中,我们研究了用户与项目空间交互作用下的两极分化及其对推荐系统的影响。我们首先根据项目评分将极化的概念形式化,然后将其与项目评论相关联,以调查任何潜在的相关性。然后,我们提出了一个独立于域的数据科学管道,以使用评级自动检测极化,而不是使用用于检测极化的典型属性(例如项目的内容或社交网络拓扑)自动检测极化。我们在几个基准数据集上对偏振测量进行了广泛的比较,并表明我们的偏振检测框架可以检测不同程度的偏振,并且在捕获直观的偏振概念方面优于现有测量。我们的工作是朝着量化和检测正在进行的等级和基准数据集中的极化的重要步骤,为此,我们使用我们开发的极化检测管线来计算多个基准数据集的极化发生率。我们希望这项工作将有助于支持在极化环境中设计和研究推荐系统行为这一新兴主题方面的未来研究。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号