Text Mining for Plagiarism Detection: Multivariate Pattern Detection for Recognition of Text Similarities

机译：抄袭检测的文本挖掘：用于识别文本相似性的多元模式检测

获取原文

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

The problem of plagiarism the recent years has been intensified by the availability of information in digital form and the accessibility of the electronic libraries through the Internet. As a result, plagiarism detection has been transformed into a big data analytics problem since the number of digital sources is extravagant and a new document needs to be compared with millions of other existing documents. In this paper, a text mining methodology is proposed that can detect all common patterns between a document and the documents in a reference database. The technique is based on a pattern detection algorithm and the corresponding data structure that enables the algorithm to detect all common patterns. The methodology has been applied in a well-defined dataset providing very promising results identifying difficult cases of plagiarism such as technical disguise.

机译：近年来，由于存在数字形式的信息以及电子图书馆可以通过互联网访问，窃问题变得更加严重。结果，since窃检测已转化为大数据分析问题，因为数字资源的数量非常庞大，并且需要将新文档与数百万其他现有文档进行比较。本文提出了一种文本挖掘方法，可以检测文档和参考数据库中文档之间的所有常见模式。该技术基于模式检测算法和使该算法能够检测所有常见模式的相应数据结构。该方法已应用于定义明确的数据集中，提供了非常有希望的结果，可识别出诸如技术伪装之类的difficult窃困难案例。

著录项

来源
《IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining;International Symposium on Foundations of Open Source Intelligence and Security Informatics;International Symposium on Foundations and Applications of Big Data Analytics;International Symposium on Network Enabled Health Informatics, Biomedicine and Bioinformatics》|2018年|938-945|共8页
会议地点 Barcelona(ES)
作者
Konstantinos Xylogiannopoulos; Panagiotis Karampelas; Reda Alhajj;
展开▼
作者单位

University of Calgary Department of Computer Science Calgary Canada;

Division of Informatics Computer Hellenic Air Force Academy Dekelia Greece;

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Plagiarism; Libraries; Text mining; Data structures; Internet; Big Data; Tools;

机译：gi窃；图书馆；文本挖掘；数据结构;互联网;大数据;工具类;

相似文献

外文文献
中文文献
专利

1. Role of Text Mining in Detection of Plagiarism in Arabic Texts: An Architectural Perspective [J] . Abdullah Al Hussein Research journal of applied science, engineering and technology . 2016,第4期

机译：文本挖掘在抄袭阿拉伯文本中的作用：建筑学的角度
2. Role of Text Mining in Detection of Plagiarism in Arabic Texts: An Architectural Perspective [J] . Abdullah Al Hussein Research journal of applied science, engineering and technology . 2016,第4期

机译：文本挖掘在抄袭阿拉伯文本中的作用：建筑学的角度
3. Text Plagiarism Detection Method Based On Path Patterns [J] . Chun Kit See, Kuok-Shoong Wong, Wei Lee Woon International Journal of Business Intelligence and Data Mining . 2008,第2期

机译：基于路径模式的文本抄袭检测方法
4. Text Mining for Plagiarism Detection: Multivariate Pattern Detection for Recognition of Text Similarities [C] . Konstantinos Xylogiannopoulos, Panagiotis Karampelas, Reda Alhajj IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining . 2018

机译：抄袭检测的文本挖掘：多变量模式检测识别文本相似度
5. An Automatic Similarity Detection Engine Between Sacred Texts Using Text Mining and Similarity Measures [D] . Qahl, Salha Hassan Muhammed. 2014

机译：使用文本挖掘和相似度度量的神圣文本之间的自动相似度检测引擎
6. Implementation and comparison of two text mining methods with a standard pharmacovigilance method for signal detection of medication errors [O] . Nadine Kadi Eskildsen, Robert Eriksson, Sten B. Christensen, 2020

机译：两种文本挖掘方法与用于药物错误信号检测的标准药物警戒方法的实现和比较
7. Context Similarity Strategy for Text Data Plagiarism Detection [O] . Durga Bhavani Dasari, Dr Venu Gopala Rao. K 2018

机译：文本数据抄袭检测的上下文相似策略

Text Mining for Plagiarism Detection: Multivariate Pattern Detection for Recognition of Text Similarities

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅