首页> 外文会议>IEEE International Conference on Artificial Intelligence and Computer Applications >Code Plagiarism Detection Method Based on Code Similarity and Student Behavior Characteristics
【24h】

Code Plagiarism Detection Method Based on Code Similarity and Student Behavior Characteristics

机译:基于代码相似度和学生行为特征的代码抄袭检测方法

获取原文

摘要

We proposed a plagiarism detection approach based on code similarity and student behavior characteristics in educational scenarios. The traditional plagiarism check is based on the code only, which enables that students can escape inspection by modifying a small amount of code. We proposed that if the behavioral characteristics of students when submitting code can be considered, the suspected plagiarism can be more accurately identified. We proposed the concept of code similarity concentration (SCD) with reference to the Gini coefficient idea. SCD can reflect the similarity distribution between all the codes submitted by a student and others' codes. A large value of SCD means that a student's codes are always the most similar to the codes of some particular classmates. In addition, we also extracted other features to help detection. Finally, we classify the plagiarism detection problem as a binary classification problem and use LightGBM to make decisions. The experimental results show that the accuracy is close to 99% and f1-score is close to 98%.
机译:我们提出了一种基于代码相似性和教育场景中学生行为特征的窃检测方法。传统的gi窃检查仅基于代码,这使学生可以通过修改少量代码来逃脱检查。我们建议,如果可以考虑学生在提交密码时的行为特征,则可以更准确地识别涉嫌抄袭。我们参考基尼系数的想法提出了代码相似度集中度(SCD)的概念。 SCD可以反映出学生提交的所有代码与其他人的代码之间的相似性分布。 SCD的值很大,表示学生的密码始终与某些特定同学的密码最相似。此外,我们还提取了其他功能来帮助检测。最后,我们将pla窃检测问题归类为二进制分类问题,并使用LightGBM做出决策。实验结果表明,该算法的准确度接近99%,f1-score接近98%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号