首页> 外文会议>International Conference on Document Analysis and Recognition >Achieving Linguistic Provenance via Plagiarism Detection
【24h】

Achieving Linguistic Provenance via Plagiarism Detection

机译:通过抄袭检测实现语言出处

获取原文

摘要

To go beyond what current provenance systems can capture for natural language text documents, we propose the Lincoln Laboratory Plagiarism for Provenance System (LLPla Ì) as an approach for capturing linguistic provenance. Linguistic provenance infers the origin of text based on its linguistic structure. We take a plagiarism detection approach to this task as identifying similar sections of text is fundamental to linguistic provenance and central to LLPla Ì's performance. Thus, to determine the most viable plagiarism detection algorithm for use in LLPla Ì, we evaluate three state-of-the-art plagiarism detection algorithms. Moreover, we propose extensions to the best-performing algorithm that improve its precision with negligible effects on recall.
机译:超越目前的出处系统可以捕获自然语言文本文件,我们提出了林肯实验室抄袭来源系统(LLPLAì)作为捕获语言来源的方法。语言来源基于其语言结构的基础。我们对此任务采取抄袭检测方法,因为识别文本的类似部分是语言出处的基础,以及LLPLAì的核心。因此,为了确定用于LLPLA的最活泼的抄袭检测算法,我们评估了三种最先进的抄袭检测算法。此外,我们向最佳性能算法提出了延伸,从而提高了其精确度,对召回的可忽略效果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号