...
首页> 外文期刊>The international journal of engineering education >Metrics for Estimating Validity, Reliability and Bias in Peer Assessment
【24h】

Metrics for Estimating Validity, Reliability and Bias in Peer Assessment

机译:对等评估中评估有效性,可靠性和偏见的指标

获取原文
获取原文并翻译 | 示例

摘要

Peer assessment is a widespread way of evaluating and rating the quality of a work in the field of education. Although it results to be a very effective learning instrument, it is subjected to possible problems of reliability, validity and some potential biases. Most works that study and try to solve these problems are focused on specific cases and the statistics for measuring reliability, validity or bias are global, that is, they give a measure of these values for the whole process, but they do not allow an individual study. In this work the approach is different. It proposes some metrics for reliability and validity of each reviewer, as well as an approximation to the possible biases that may appear in the assessment process, so that the review process can be itself assessed. An analogy between the work of a reviewer in a process of peer assessment and the operation of an automatic classifier is proposed. This has allowed us to leverage the usual measures in evaluating the quality of automatic classifiers to establish the quality of peer assessment. The reviewers are characterized by obtaining their confusion matrices and six new indicators: success rate (which estimates the validity); agreement degree (as a measure of reliability); assessment median and its interquartile range (for the estimation of central tendency and restriction of range biases); and average distance to diagonal and its standard deviation (to determine possible leniency and harshness biases). This method provides indicators of the reviewer's task and the detection of different profiles, so that the teacher can assess the work of the students as reviewers and introduce some correction mechanisms in the final assessment of the works. A practical example of application to an engineering degree is provided to illustrate the potential of the method.
机译:同伴评估是评估和评价教育领域工作质量的一种广泛方法。尽管它成为一种非常有效的学习工具,但它可能会遇到可靠性,有效性和某些潜在偏见的问题。研究和尝试解决这些问题的大多数作品都集中在特定的案例上,并且用于衡量信度,效度或偏倚的统计数据是全球性的,也就是说,它们在整个过程中都对这些值进行了度量,但不允许个人使用。研究。在这项工作中,方法是不同的。它提出了一些有关每个审阅者的可靠性和有效性的度量标准,以及对评估过程中可能出现的偏差的近似估计,以便可以对评估过程本身进行评估。提出了在同行评估过程中审阅者的工作与自动分类器的操作之间的类比。这使我们能够利用通常的措施来评估自动分类器的质量,从而建立同行评估的质量。审稿人的特征在于获得他们的混淆矩阵和六个新指标:成功率(用于评估有效性);协议度(作为可靠性的度量);评估中位数及其四分位数间距(用于估计集中趋势和范围偏差的限制);到对角线的平均距离及其标准偏差(以确定可能的宽大度和粗糙度偏差)。此方法提供了审阅者任务的指标以及不同档案的检测,因此教师可以评估作为审阅者的学生的工作,并在作品的最终评估中引入一些更正机制。提供了工程程度的实际应用示例,以说明该方法的潜力。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号