首页> 外文期刊>International journal of agent technologies and systems >Multi-Agents Machine Learning (MML) System for Plagiarism Detection
【24h】

Multi-Agents Machine Learning (MML) System for Plagiarism Detection

机译:Pla窃检测的多代理商机器学习(MML)系统

获取原文
获取原文并翻译 | 示例
       

摘要

Day after day the cases of plagiarism increase and become a crucial problem in the modern world caused by the quantity of textual information available in the web. Data mining becomes the foundation for many different domains as one of its chores is the text categorization, which can be used in order to resolve the impediment of automatic plagiarism detection. This article is devoted to a new approach for combating plagiarism named MML (Multi-agents Machine learning system) and is composed of three modules: data preparation and digitalization, using n-gram character or bag of words as methods for the text representation; TF*IDF as weighting to calculate the importance of each term in the corpus in order to transform each document to a vector; and learning and voting phase using three supervised learning algorithms (decision tree c4.5, naive Bayes and support vector machine).
机译:由于网络上可用的文本信息数量众多,抄袭案件日渐增多,并成为现代世界中的一个关键问题。数据挖掘成为许多不同领域的基础,因为其繁琐的工作之一就是文本分类,可用于解决自动抄袭检测的​​障碍。本文专门介绍一种称为MML(Multi-agents机器学习系统)的打击窃的新方法,它由三个模块组成:数据准备和数字化,使用n-gram字符或单词袋作为文本表示方法; TF * IDF作为权重,用于计算语料库中每个术语的重要性,以便将每个文档转换为向量;使用三种监督学习算法(决策树c4.5,朴素贝叶斯和支持向量机)进行学习和投票。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号