【24h】

Using string similarity metrics for automated grading of SQL statements

机译:使用字符串相似性度量标准对SQL语句进行自动分级

获取原文

摘要

Manual grading of structured query language (SQL) statements after an exam can be tedious and time consuming for the teaching assistant. Additionally, it can also be subjective to her current state of mind and, thus, prone to errors. In this paper we propose an automated method for grading individual SQL statements. The method uses several common and simple string similarity metrics for comparing the student devised statements against the reference statements. These are then used, along with the manually assigned grades, for building the predictive logistic regression model. The proposed method was evaluated on a dataset consisting of 314 pairs of student-reference statements, along with the discretized average grade assigned by three independent evaluators. The model achieved the expected classification accuracy of 78% on a binary class, thus exhibiting its potential for real-life application. The model can be used as is with the suggested calculated features and reported learnt parameters, or adapted to other examiners' evaluation criteria, presuming their willingness to build manually graded datasets of their own.
机译:考试后对结构化查询语言(SQL)语句进行手动评分可能对助教来说既乏味又耗时。另外,它也可能受她当前的心理状态影响,因此容易出错。在本文中,我们提出了一种用于对单个SQL语句进行评分的自动方法。该方法使用几种常见和简单的字符串相似性度量标准,以将学生设计的语句与参考语句进行比较。然后将这些与人工指定的等级一起用于构建预测逻辑回归模型。所提出的方法是在由314对学生参考陈述组成的数据集上进行评估的,并由三个独立的评估者分配了离散化的平均成绩。该模型在二元分类中达到了预期的78%的分类精度,因此展现了其在实际应用中的潜力。该模型可以与建议的计算特征和报告的学习参数一起使用,也可以根据其他检查者的评估标准进行调整,前提是他们愿意建立自己的手动分级数据集。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号