首页> 外文期刊>Computer Science and Information Technology >An Improvement of Plagiarized Area Detection System Using Jaccard Correlation Coefficient Distance Algorithm
【24h】

An Improvement of Plagiarized Area Detection System Using Jaccard Correlation Coefficient Distance Algorithm

机译:基于Jaccard相关系数距离算法的Pla窃区域检测系统的改进。

获取原文
       

摘要

In this paper, a plagiarized area detection system is proposed in which Jaccard correlation coefficient is used for filtering to improve the processing time against huge volume of documents. Hence, the proposed system does filter to efficiently detect plagiarized area against huge volume of original documents by two algorithms; Jaccard coefficient distance algorithm and Cosine distance algorithm. Since Jaccard coefficient distance algorithm computes the distance between two document based only on the existence of words while Cosine distance algorithm uses word's frequency also, Jaccard coefficient distance algorithm is faster than Cosine one. Hence, for the efficiency, we use Jaccard coefficient distance algorithm as the first filter. According to the experiment result of the performance comparison between the proposed system and the previous our system, the newly proposed system outperforms the previous one with about 30% reduced processing time.
机译:本文提出了一种J窃区域检测系统,该系统利用雅卡德相关系数进行滤波,以提高处理大量文件的时间。因此,所提出的系统通过两种算法进行过滤,以针对大量原始文件有效地检测出area窃区域。雅卡德系数距离算法和余弦距离算法。由于Jaccard系数距离算法仅根据单词的存在来计算两个文档之间的距离,而Cosine距离算法也使用单词的频率,因此Jaccard系数距离算法要比Cosine更快。因此,为了提高效率,我们使用Jaccard系数距离算法作为第一个滤波器。根据所提出的系统与以前的系统之间的性能比较的实验结果,新提出的系统优于先前的系统,处理时间减少了约30%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号