首页> 外文期刊>Medical education >Reliability estimates: Behavioural stations and questionnaires in medical school admissions
【24h】

Reliability estimates: Behavioural stations and questionnaires in medical school admissions

机译:可靠性评估:医学院入学时的行为测站和问卷调查

获取原文
获取原文并翻译 | 示例
       

摘要

Context Assessment centres used in evaluating the non-cognitive attributes of medical school candidates must generate scores that reflect as accurate a measurement as possible of these attributes. Thus far, reliability coefficients for such centres have been based on limited samples and individual administrations, without reference to the error of variance that may result from retesting, or from the existence of multiple centres designed to measure the same attributes. Methods The National Institute for Testing and Evaluation in Israel has developed and administered two assessment centres: MOR is used by two medical schools and one dental school, and MIRKAM by another medical school. Each centre comprises eight or nine behavioural stations, a standardised biographical questionnaire, and a judgement and decision-making questionnaire. We calculated generalisability coefficients for each centre's eight or nine stations by year, composite reliability coefficients for the overall assessment centres, test-retest correlation coefficients for repeaters, and a correlation coefficient between the centres. Results Between 2006 and 2009, 2662 and 2023 examinees participated in MOR and MIRKAM, respectively; 1479 of these participated in both. The average generalisability coefficients for the stations were 0.69 for MOR and 0.67 for MIRKAM. The composite reliability coefficients for the full centres (behavioural stations plus questionnaires) were 0.79 and 0.76 for MOR and MIRKAM, respectively. The correlations for repeaters, corrected for restriction of range, were 0.59 and 0.43 for MOR and MIRKAM stations, respectively, and 0.72 and 0.65 for the full MOR and MIRKAM assessments, respectively. The correlation between scores on the MOR and MIRKAM stations was 0.56 (0.75 for the overall score). Discussion The minimal reliability desirable for high-stakes decision making (0.80) was obtained only for 14 or 15 stations with questionnaires. Nevertheless, the values obtained are considerably higher than reliability coefficients for single interviews. The questionnaires contribute significantly to the accuracy of the measurement. These reliability measures constitute an upper threshold for measures of validity.
机译:用于评估医学院学生的非认知属性的情境评估中心必须生成分数,以反映这些属性的尽可能准确的度量。到目前为止,此类中心的可靠性系数是基于有限的样本和个体管理,而不涉及重新测试或设计用于测量相同属性的多个中心可能导致的方差误差。方法以色列国家测试与评估研究所已经建立并管理了两个评估中心:两所医学院和一所牙科学校使用MOR,另一所医学院使用MIRKAM。每个中心包括八个或九个行为站,一个标准化的传记问卷以及一个判断和决策问卷。我们按年计算了每个中心的八个或九个站点的通用性系数,整个评估中心的综合可靠性系数,转发器的重测相关系数以及两个中心之间的相关系数。结果2006年至2009年,分别有2662名和2023名考生参加了MOR和MIRKAM。其中的1479人参加了两次。气象站的平均推广系数为0.69,而MIRKAM为0.67。 MOR和MIRKAM的整个中心(行为站加问卷)的综合可靠性系数分别为0.79和0.76。校正了距离限制后的中继器的相关性,MOR和MIRKAM台站分别为0.59和0.43,完整的MOR和MIRKAM评估站分别为0.72和0.65。 MOR和MIRKAM站的分数之间的相关性为0.56(总分数为0.75)。讨论仅针对14个或15个带问卷的站点获得了高风险决策所需的最低可靠性(0.80)。尽管如此,获得的值仍远高于单次访谈的可靠性系数。问卷对测量的准确性有很大的贡献。这些可靠性指标构成了有效性指标的上限。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号