首页> 外文会议>International conference on machine learning and data mining >Detection of Computer-Generated Papers Using One-Class SVM and Cluster Approaches
【24h】

Detection of Computer-Generated Papers Using One-Class SVM and Cluster Approaches

机译:使用一类SVM和聚类方法检测计算机生成的论文

获取原文

摘要

The paper presents a novel methodology intended to distinguish between real and artificially generated manuscripts. The approach employs inherent differences between the human and artificially generated wring styles. Taking into account the nature of the generation process, we suggest that the human style is essentially more "diverse" and "rich" in comparison with an artificial one. In order to assess dissimilarities between fake and real papers, a distance between writing styles is evaluated via the dynamic dissimilarity methodology. From this standpoint, the generated papers are much similar in their own style and significantly differ from the human written documents. A set of fake documents is captured as the training data so that a real document is expected to appear as an outlier in relation to this collection. Thus, we analyze the proposed task in the context of the one-class classification using a one-class SVM approach compared with a clustering base procedure. The provided numerical experiments demonstrate very high ability of the proposed method-ology to recognize artificially generated papers.
机译:本文提出了一种旨在区分真实手稿和人工生成手稿的新颖方法。该方法利用了人类和人工生成的拧紧样式之间的固有差异。考虑到生成过程的本质,我们建议与人工模型相比,人的样式本质上更加“多样化”和“丰富”。为了评估假冒和真实论文之间的差异,通过动态差异方法对写作风格之间的距离进行了评估。从这个角度来看,生成的论文在样式上非常相似,并且与人工撰写的文档有很大的不同。一组伪造的文档被捕获为训练数据,因此,与该集合相比,真实的文档有望表现为离群值。因此,与基于聚类的基本过程相比,我们使用一类SVM方法在一类分类的上下文中分析了拟议的任务。所提供的数值实验证明了所提出的方法论具有很高的识别人工生成论文的能力。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号