首页> 外文会议>Conference on empirical methods in natural language processing >Intrinsic Plagiarism Detection using N-gram Classes
【24h】

Intrinsic Plagiarism Detection using N-gram Classes

机译:使用N-gram类进行内在Pla窃检测

获取原文

摘要

When it is not possible to compare the suspicious document to the source document(s) plagiarism has been committed from, the evidence of plagiarism has to be looked for intrinsically in the document itself. In this paper, we introduce a novel language-independent intrinsic plagiarism detection method which is based on a new text representation that we called n-gram classes. The proposed method was evaluated on three publicly available standard corpora. The obtained results are comparable to the ones obtained by the best state-of-the-art methods.
机译:如果无法将可疑文件与源文件进行窃,则必须在文件本身中内在地寻找窃的证据。在本文中,我们介绍了一种新颖的,独立于语言的固有抄袭检测方法,该方法基于称为n-gram类的新文本表示形式。对三种公开的标准语料库对提出的方法进行了评估。所获得的结果可与通过最佳最新技术方法获得的结果相媲美。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号