首页> 外文期刊>Aslib Proceedings >Testing the stability of 'wisdom of crowds' judgments of search results over time and their similarity with the search engine rankings
【24h】

Testing the stability of 'wisdom of crowds' judgments of search results over time and their similarity with the search engine rankings

机译:测试随着时间推移“人群智慧”判断结果的稳定性以及它们与搜索引擎排名的相似性

获取原文
获取原文并翻译 | 示例
       

摘要

Purpose - One of the under-explored aspects in the process of user information seeking behaviour is influence of time on relevance evaluation. It has been shown in previous studies that individual users might change their assessment of search results over time. It is also known that aggregated judgements of multiple individual users can lead to correct and reliable decisions; this phenomenon is known as the "wisdom of crowds". The purpose of this paper is to examine whether aggregated judgements will be more stable and thus more reliable over time than individual user judgements. Design/methodology/approach - In this study two simple measures are proposed to calculate the aggregated judgements of search results and compare their reliability and stability to individual user judgements. In addition, the aggregated "wisdom of crowds" judgements were used as a means to compare the differences between human assessments of search results and search engine's rankings. A large-scale user study was conducted with 87 participants who evaluated two different queries and four diverse result sets twice, with an interval of two months. Two types of judgements were considered in this study: relevance on a four-point scale, and ranking on a ten-point scale without ties. Findings - It was found that aggregated judgements are much more stable than individual user judgements, yet they are quite different from search engine rankings. Practical implications - The proposed "wisdom of crowds"-based approach provides a reliable reference point for the evaluation of search engines. This is also important for exploring the need of personalisation and adapting search engine's ranking over time to changes in users preferences. Originality/value - This is a first study that applies the notion of "wisdom of crowds" to examine an under-explored in the literature phenomenon of "change in time" in user evaluation of relevance.
机译:目的-在用户信息搜索行为过程中,有待探索的方面之一是时间对相关性评估的影响。先前的研究表明,单个用户可能会随着时间的推移更改其对搜索结果的评估。众所周知,多个用户的综合判断可以导致正确和可靠的决策;这种现象被称为“人群的智慧”。本文的目的是检验聚合的判断是否比单个用户的判断更稳定,从而在时间上更可靠。设计/方法/方法-在这项研究中,提出了两种简单的措施来计算搜索结果的综合判断,并将其可靠性和稳定性与单个用户的判断进行比较。此外,汇总的“人群的智慧”判断被用作比较人类对搜索结果的评估与搜索引擎排名之间差异的一种手段。对87位参与者进行了大规模的用户研究,他们两次对两个不同的查询和四个不同的结果集进行了两次评估,时间间隔为两个月。在本研究中考虑了两种类型的判断:四点量表的相关性和十点量表的无联系等级。调查结果-发现综合判断比单个用户的判断稳定得多,但与搜索引擎排名却大不相同。实际意义-提议的基于“人群智慧”的方法为评估搜索引擎提供了可靠的参考点。这对于探索个性化需求以及使搜索引擎的排名随时间变化以适应用户偏好的变化也很重要。原创性/价值-这是第一个应用“人群的智慧”概念研究在用户相关性评估中未充分挖掘的“时间变化”现象的研究。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号