首页> 外文期刊>Journal of Organizational and End User Computing >Unsupervised Model for Detecting Plagiarism in Internet-based Handwritten Arabic Documents
【24h】

Unsupervised Model for Detecting Plagiarism in Internet-based Handwritten Arabic Documents

机译:基于Internet的阿拉伯手写文档中抄袭的无监督模型

获取原文
获取原文并翻译 | 示例
       

摘要

Due to the rapid increase of internet-based data, there is urgent need for a robust intelligent documents security mechanism. Although there are many attempts to build a plagiarism detection system in natural language documents, the unlimited variation and different writing styles of each character in Arabic documents make building such systems challenging. Based on its position in a word, the same Arabic letter can be written three different ways, which makes the handwritten character recognition a cumbersome process. This article proposes an intelligent unsupervised model to detect plagiarism in these documents called ASTAP. First, a handwritten Arabic character recognition system is proposed using the Grey Wolf Optimization (GWO) algorithm. Then, a modified Abstract Syntax Tree (AST) is used to match the contents of the Arabic documents to detect any similarity. Compared to the state-of-the-art methods, ASTAP improves the effectiveness of the plagiarism detection in terms of the matched similarity ratio, the precision ratio, and the processing time.
机译:由于基于互联网的数据的迅速增长,迫切需要一种强大的智能文档安全机制。尽管已经尝试了许多在自然语言文档中构建detection窃检测系统的尝试,但是阿拉伯文档中每个字符的无限变化和不同的书写风格使构建这样的系统具有挑战性。根据其在单词中的位置,可以用三种不同的方式来书写相同的阿拉伯字母,这使得手写字符识别成为一个繁琐的过程。本文提出了一种智能的无监督模型来检测这些称为ASTAP的文档中的抄袭。首先,提出了一种使用灰狼优化算法的手写阿拉伯字符识别系统。然后,使用修改后的抽象语法树(AST)来匹配阿拉伯文档的内容以检测任何相似性。与最先进的方法相比,ASTAP在匹配相似率,精确率和处理时间方面提高了窃检测的有效性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号