首页> 美国政府科技报告 >Experimental Design for Measuring the Intra- and Inter-Group Consistency of Human Judgment of Relevance
【24h】

Experimental Design for Measuring the Intra- and Inter-Group Consistency of Human Judgment of Relevance

机译:测量人类相关性判断的组内和组间一致性的实验设计

获取原文

摘要

The suspected variability of humans in judging the relevance of documents is one of the current problems confronting the development and improvement of document information and retrieval systems. The purpose of this thesis was to design a method to investigate the variation of relevance judgments between two groups of analysts and among the analysts within each group. A pilot experiment was conducted using two groups of analysts (subject experts and non-experts) and two question-document collections (machine retrieved and randomly selected). Analysts were instructed to mark each document relevant or not-relevant to the given question and to record the time required to make such relevance assessments. The responses were analyzed statistically. The data permitted the following conclusions: (1) the analysts within the groups could consistently agree on the relevance of documents to questions; (2) the degree of consistency of the two groups did not differ significantly; (3) the two groups did agree on the relevance of a particular document to a question; and (4) the method of document selection had a serious effect only on the consistency of the group of non-experts.

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号