首页> 外文期刊>Information Fusion >DOCODE 3.0 (DOcument COpy DEtector): A system for plagiarism detection by applying an information fusion process from multiple documental data sources
【24h】

DOCODE 3.0 (DOcument COpy DEtector): A system for plagiarism detection by applying an information fusion process from multiple documental data sources

机译:DOCODE 3.0(文档复制检测器):通过应用来自多个文档数据源的信息融合过程来进行窃检测的系统

获取原文
获取原文并翻译 | 示例
       

摘要

Plagiarism refers to the act of presenting external words, thoughts, or ideas as one's own, without providing references to the sources from which they were taken. The exponential growth of different digital document sources available on the Web has facilitated the spread of this practice, making the accurate detection of it a crucial task for educational institutions. In this article, we present DOCODE 3.0, a Web system for educational institutions that performs automatic analysis of large quantities of digital documents in relation to their degree of originality. Since plagiarism is a complex problem, frequently tackled at different levels, our system applies algorithms in order to perform an information fusion process from multi data source to all these levels. These algorithms have been successfully tested in the scientific community in solving tasks like the identification of plagiarized passages and the retrieval of source candidates from the Web, among other multi data sources as digital libraries, and have proven to be very effective. We integrate these algorithms into a multi-tier, robust and scalable JEE architecture, allowing many different types of clients with different requirements to consume our services. For users, DOCODE produces a number of visualizations and reports from the different outputs to let teachers and professors gain insights on the originality of the documents they review, allowing them to discover, understand and handle possible plagiarism cases and making it easier and much faster to analyze a vast number of documents. Our experience here is so far focused on the Chilean situation and the Spanish language, offering solutions to Chilean educational institutions in any of their preferred Virtual Learning Environments. However, DOCODE can easily be adapted to increase language coverage. (C) 2015 Elsevier B.V. All rights reserved.
机译:窃是指将自己的外来言语,思想或观念呈现为自己的行为,而不提供对其来源的引用。 Web上不同数字文档来源的呈指数增长,促进了这种做法的传播,因此,准确检测它对于教育机构而言是至关重要的任务。在本文中,我们介绍了DOCODE 3.0,这是一个用于教育机构的Web系统,它可以根据其原创性对大量数字文档进行自动分析。由于窃是一个复杂的问题,经常在不同级别上得到解决,因此我们的系统应用算法来执行从多数据源到所有这些级别的信息融合过程。这些算法已经在科学界成功地进行了测试,可以解决tasks窃段落的识别和从Web检索源候选者等任务,以及作为数字图书馆的其他多种数据源,并且被证明是非常有效的。我们将这些算法集成到多层,健壮和可扩展的JEE体系结构中,从而允许具有不同要求的许多不同类型的客户端使用我们的服务。对于用户而言,DOCODE可以从不同的输出中产生大量的可视化效果和报告,以使教师和教授对他们审阅的文档的独创性有深刻的见解,从而使他们能够发现,理解和处理可能的cases窃案,并使其更容易,更快捷地进行。分析大量文档。到目前为止,我们在这里的经验集中在智利的情况和西班牙语上,为智利的教育机构提供了在任何首选的虚拟学习环境中的解决方案。但是,可以轻松修改DOCODE以增加语言覆盖范围。 (C)2015 Elsevier B.V.保留所有权利。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号