首页> 外文会议>IEEE International Conference on Intelligent Computing and Information Systems >FPSS: Fingerprint-based semantic similarity detection in big data environment
【24h】

FPSS: Fingerprint-based semantic similarity detection in big data environment

机译:FPSS:大数据环境中基于指纹的语义相似度检测

获取原文

摘要

Although the problem of plagiarism is an ancient problem that exists before the start of internet revolution, the accessibility of free and easy accessed electronic paper on the Internet complicated and increased the problem. However, there are many systems for detecting plagiarism in natural language documents. Contrary to Latin documents, the same Arabic letter can be written into three various ways based on its position in the word. The complex nature of writing Arabic documents makes such system is a big challenge. Accordingly, this paper presents a Fingerprint-Based Semantic Similarity detection system, called (FPSS) to detect plagiarism in Arabic documents. It generates a digital fingerprint (df) for each sentence and compares all the df values. Moreover, it analyzes corresponding detection schemes to detect Semantic Similarity effectively. FPSS improves the effectiveness regarding the matched similarity ratio, the precision ratio, the recall ratio, the F-measure ratio, the plagdet ratio, and the granularity ratio.
机译:尽管of窃问题是互联网革命开始之前存在的古老问题,但是免费和易于访问的电子纸在Internet上的可访问性使问题变得复杂并加剧了这一问题。但是,有许多系统可以检测自然语言文档中的窃行为。与拉丁文文档相反,根据其在单词中的位置,可以用三种不同的方式来书写同一阿拉伯字母。书写阿拉伯文件的复杂性使得这种系统面临巨大挑战。因此,本文提出了一种基于指纹的语义相似度检测系统,称为(FPSS),用于检测阿拉伯文档中的抄袭行为。它为每个句子生成一个数字指纹(df),并比较所有df值。此外,它分析了相应的检测方案以有效地检测语义相似度。 FPSS提高了匹配相似度比,精度比,召回率,F量度比,plagdet比和粒度比的有效性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号