首页> 外文会议>ANLP 2011 >Lexical Generalisation for Word-level Matching in Plagiarism Detection
【24h】

Lexical Generalisation for Word-level Matching in Plagiarism Detection

机译:词汇普遍匹配在抄袭检测中的词汇概括

获取原文

摘要

Plagiarism has always been a concern in many sectors, particularly in education. With the sharp rise in the number of electronic resources available online, an increasing number of plagiarism cases has been observed in recent years As the amount of source materials is vast, the use of plagiarism detection tools has become the norm to aid the investigation of possible plagiarism cases. This paper describes an approach to improve plagiarism detection by incorporating a lexical generalisation technique. The goal is to identify plagiarised texts even if they are paraphrased using different words. Experiments performed on a subset of the PAN'10 corpus show that the matching approach involving lexical generalisation yields promising results, as compared to standard n-gram matching strategies.
机译:抄袭始终是许多部门的关注,特别是在教育中。随着在线获得的电子资源数量急剧上升,近年来越来越多的抄袭病例随着源材料的数量庞大,抄袭检测工具的使用已经成为帮助可能的调查的准则抄袭病例。本文介绍了一种通过纳入词汇概括技术来改善抄袭检测的​​方法。目标是识别抄袭文本,即使它们是使用不同的单词释放的。在Pan'10语料库的子集上进行的实验表明,与标准的N-GRAM匹配策略相比,涉及词汇概括的匹配方法产生了有希望的结果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号