首页> 外文期刊>Web Intelligence and Agent Systems >SimPaD: A word-similarity sentence-based plagiarism detection tool on Web documents
【24h】

SimPaD: A word-similarity sentence-based plagiarism detection tool on Web documents

机译:SimPaD:Web文档上基于单词相似性句子的窃检测工具

获取原文
获取原文并翻译 | 示例
           

摘要

Plagiarism is a serious problem that infringes copyrighted documents/materials, which is an unethical practice and decreases the economic incentive received by their legal owners. Unfortunately, plagiarism is getting worse due to the increasing number of on-line publications and easy access on the Web, which facilitates locating and paraphrasing information. In solving this problem, we propose a novel plagiarism-detection method, called SimPaD, which (i) establishes the degree of resemblance between any two documents D and L>2 based on their sentence-to-sentence similarity computed by using pre-defined word-correlation factors, and (ii) generates a graphical view of sentences that are similar (or the same) in D and £>2. Experimental results verify that SimPaD is highly accurate in detecting (non-)plagiarized documents and outperforms existing plagiarism-detection approaches.
机译:窃是一个严重的问题,侵犯了受版权保护的文件/材料,这是一种不道德的做法,并降低了其合法所有者获得的经济诱因。不幸的是,由于在线出版物数量的增加和在网络上的便捷访问,窃行为变得越来越糟,这有助于查找和解释信息。为了解决这个问题,我们提出了一种新颖的窃检测方法,称为SimPaD,该方法(i)根据两个文档D和L> 2的句子相似度,通过使用预先定义的句子和句子相似度来确定它们之间的相似度单词相关因子,以及(ii)生成在D和£> 2中相似(或相同)的句子的图形视图。实验结果证明SimPaD在检测(非)pla窃文件方面非常准确,并且优于现有的existing窃检测方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号