首页> 外文会议>International Conference on Information and Communication Technology >Performance efficiency in plagiarism indication detection system using indexing method with data structure 2amp;#x2013;3 tree
【24h】

Performance efficiency in plagiarism indication detection system using indexing method with data structure 2amp;#x2013;3 tree

机译:使用索引方法使用数据结构2&#x2013的索引指示检测系统的性能效率。3树

获取原文

摘要

Plagiarism is a form of cheating that has been so much happen. One of prevention is to make the anti-plagiarism system. The system that must compare a query document with all documents in the database requires a very long time. The more irrelevant document in database compare with the query that will be matched will waste the time. This paper will discuss a system to detect plagiarism by using indexing method as a way to eliminate irrelevant documents in order to reduce the document database that will be matched with the query document. Matching between a query document and documents in database will be done with Longest Common Subsequence (LCS) algorithm. The system will use inverted index as the form to eliminate irrelevant documents using a 2–3 tree data structure. Indexing is done by inserting the fingerprint of the document. To find the fingerprint this paper will use winnowing algorithm. The results of the system shows to execute 1 query and 10000 documents corpus, most of them are not relevant, takes 59 seconds and 134 seconds with and without respectively. The f-measure value, the average value of precision and recall, is obtained 0.7387 by indexing with 0.15 as the threshold of indexing elimination and 0.000428 without indexing.
机译:剽窃是一种欺骗的形式,这是如此偶然发生。预防之一是制造反抄袭系统。必须将查询文档与数据库中所有文档进行比较的系统需要很长时间。数据库中的文档越多于将匹配的查询比较将浪费时间。本文将讨论通过使用索引方法作为消除无关文档的方法来检测抄袭的系统,以减少与查询文档匹配的文档数据库。数据库中的查询文档和文档之间的匹配将以最长的公共文档(LCS)算法完成。系统将使用反向索引作为使用2-3树数据结构消除无关文档的表单。索引是通过插入文档的指纹来完成的。要查找本文将使用Winnowing算法的指纹。系统的结果显示为执行1查询和10000件文档语料库,其中大多数都不相关,需要59秒和134秒,分别不合适。通过索引0.7387,获得0.7387的F测量值,精度和召回的平均值,作为索引消除的阈值和0.000428,无索引。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号