首页> 外文会议>IEEE International Conference on computer supported cooperative work in design >A Crowd Science framework to support the construction of a Gold Standard Corpus for Plagiarism Detection
【24h】

A Crowd Science framework to support the construction of a Gold Standard Corpus for Plagiarism Detection

机译:人群科学框架,支持建设金标准语料库的抄袭检测

获取原文

摘要

The construction of a Gold Standard Corpus for Plagiarism Detection (GSCPD) is a challenging task for reproducible research in computer science, given that there is a trade off between the time expended by the experts and the size, quality, and reliability of a GSCPD. In such a challenging scenario, this paper describes a framework to support the construction of a GSCPD in any language. Aiming for reproducibility and scalability, the framework involves a data acquisition process and a Crowd Science project that employs human processing power to identify plagiarism in pairs of textual data extracted via the data acquisition process. This papers also presents the application of this framework in Portuguese language and the preliminary results of a feasibility study about the use of a tool that composes the framework.
机译:鉴于计算机科学的可重复研究,建设抄袭检测(GSCPD)的建设是一个具有挑战性的计算机科学研究,因为专家的时间和GSCPD的规模,质量和可靠性之间存在折扣。在这种具有挑战性的场景中,本文介绍了一种支持任何语言构建GSCPD的框架。针对可重复性和可扩展性,该框架涉及数据采集过程和人群科学项目,该项目采用人类处理能力,以通过数据采集过程提取的对文本数据成对的抄袭。本文还介绍了本框架在葡萄牙语中的应用以及关于使用组成框架的工具的可行性研究的初步结果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号