首页> 外文会议>International Conference on Computer Applications in Industry and Eengineering >Towards Efficient Source Code Plagiarism Detection: An N-gram-based Approach
【24h】

Towards Efficient Source Code Plagiarism Detection: An N-gram-based Approach

机译:朝向有效的源代码抄袭检测:基于N-GRAM的方法

获取原文

摘要

In this day and age, plagiarism has become a serious problem requiring the attention of the academic community at large. The problem, or plague as described in the literature, is very common in written works especially among university students due to various reasons such as time pressure, lack of understanding of what constitutes plagiarism, and the wealth of digital resources available on the Internet which make "copy/paste" activities almost natural! In order to deter students from submitting plagiarized work, educators must have a practical way to detect plagiarism. Currently there are many tools to help educators detect plagiarism within free-text essays but only a few that focus specifically on source code plagiarism. This paper proposes a new technique based on N-grams for this purpose. Our work improves on the state-of-the-art in this area by going beyond simple pair-wise submission comparisons as is the case with almost all techniques in the literature. Furthermore, our approach has empirically demonstrated improved efficiency compared to other approaches in the literature without noticeable sacrifices in accuracy.
机译:在这一天和年龄,抄袭已成为一个严重的问题,需要大的学术界注意。在文献中描述的问题或瘟疫,特别是书面作品,尤其是大学生,由于时间压力,缺乏对构成抄袭的原因以及互联网上可用的数字资源缺乏了解“复制/粘贴”活动几乎很自然!为了阻止学生提交抄袭工作,教育工作者必须有一种无法检测抄袭的方法。目前有许多工具可以帮助教育者在自由文本论文中检测抄袭,但只有一些专注于源代码抄袭的少数人。本文提出了一种基于N-GRAM的新技术为此目的。我们的作品通过超越简单的一对提交比较来提高本领域的最先进,因为几乎所有技术都有几乎所有技术的情况。此外,与文献中的其他方法相比,我们的方法已经明确证明了提高的效率,而没有明显的准确性牺牲。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号