首页> 外文会议>International Conference on Digital Information Management >Algorithm of the longest commonly consecutive word for Plagiarism detection in text based document

【24h】

Algorithm of the longest commonly consecutive word for Plagiarism detection in text based document

机译：基于文本文档中抄袭检测的最长常见连续词的算法

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Plagiarism is a form of academic misconduct which has increased with the easy access to obtain information through electronic documents and the Internet. The problem of finding document plagiarism in full text document can be viewed as a problem of finding the longest common parts of strings. Moreover, the detection system has to be capable to determine and visualize not only the common parts but also the location of the common parts in both the source and the observed document. Unlike previous research, this paper proposes a numerical based comparison algorithm that is comparable in the computation time without loosing the word order of common parts. Based on the experiment, the proposed algorithm outperforms the suffix tree in the length of observed paragraph below one hundred words.

机译：剽窃是一种学术不当行为的形式，随着通过电子文件和互联网获取信息而增加。在全文文档中查找文档抄袭的问题可以被视为找到字符串最长的常见部分的问题。此外，检测系统必须能够实力地确定和可视化源部和观察文档中的公共部分的位置。与以前的研究不同，本文提出了一种基于数值基于比较算法，其在计算时间中可比，而不减少公共部分的字阶。基于实验，所提出的算法在观察到的段落之后的后缀树上优于一百个单词的长度。

著录项

来源
《International Conference on Digital Information Management 》|2008年||共7页
会议地点
作者
Sediyono Agung; Mahamud Ku Ruhana Ku-;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP18-53;
关键词

相似文献

外文文献
中文文献
专利

1. Using word semantic concepts for plagiarism detection in text documents [J] . Chang Chia-Yang, Lee Shie-Jue, Wu Chih-Hung, Information retrieval . 2021 ,第4a5期

机译：在文本文件中使用Word语义概念进行抄袭检测
2. Implementation of Winnowing Algorithm Based K-Gram to Identify Plagiarism on File Text-Based Document [J] . Yanuar Nurdiansyah, Fiqih Nur Muharrom, Firdaus MATEC Web of Conferences . 2018 ,第1期

机译：基于K-Gram的Winnowing算法识别基于文本文件的抄袭
3. Implementation of Winnowing Algorithm Based K-Gram to Identify Plagiarism on File Text-Based Document [J] . Yanuar Nurdiansyah, Fiqih Nur Muharrom, Firdaus MATEC Web of Conferences . 2018 ,第1期

机译：基于K-Gram的Winnowing算法识别基于文本文件的抄袭
4. Algorithm of the longest commonly consecutive word for Plagiarism detection in text based document [C] . Sediyono Agung, Mahamud Ku Ruhana Ku- International Conference on Digital Information Management . 2008

机译：基于文本文档中抄袭检测的最长常见连续词的算法
5. Mono- and Cross-Lingual Paraphrased Text Reuse and Extrinsic Plagiarism Detection [D] . Sharjeel, Muhammad. 2020

机译：单次和交叉语言解读文本重用和外在抄袭检测
6. Early detection of internet trolls: Introducing an algorithm based on word pairs / single words multiple repetition ratio [O] . Sergei Monakhov, Alexandre Bovet, Alexandre Bovet, 2020

机译：早期检测互联网巨魔：引入基于词对/单词多个重复率的算法
7. A Plagiarism Detection System for Malayalam Text Based Documents with Full and Partial Copy [O] . Sindhu L., Idicula Sumam Mary 2016

机译：带有全部和部分副本的马拉雅拉姆文字文件的gi窃检测系统
8. Automated Energy Detection Algorithm Based on Consecutive Mean Excision. [R] . Tom, K. F. 2018

机译：基于连续平均切除的自动能量检测算法。

Algorithm of the longest commonly consecutive word for Plagiarism detection in text based document

摘要

著录项

相似文献

相关主题

期刊订阅