...
首页> 外文期刊>Information Technology Journal >Arabic-English Cross-language Plagiarism Detection using Winnowing Algorithm
【24h】

Arabic-English Cross-language Plagiarism Detection using Winnowing Algorithm

机译:基于Winnowing算法的阿拉伯-英语跨语言lang窃检测

获取原文
获取原文并翻译 | 示例
           

摘要

The availability of information in electronic forms and the availability of automatic translation machines has led to increased cross-language plagiarism. Manual detection of cross-language plagiarism is difficult, as such, developing an automatic system to detect such plagiarism is necessary. Although, there are a number of studies on detecting cross-language plagiarism in the form of Euro-English, Malay-English and Indonesian-English, there remains few studies concerned with the detection of Arabic-English cross-language plagiarism. This study proposes an Arabic-English cross-language plagiarism detection tool using the Winnowing algorithm. We evaluate its performance in terms of precision and recall on a data set consisting of Wikipedia articles. The performance of the proposed tool proved good with 97% precision, 81% recall and 89% F-measure evaluation metrics. The results show that the Winnowing algorithm can be used effectively to detect Arabic-English cross-language plagiarism.
机译:电子形式的信息可用性和自动翻译机的可用性导致跨语言窃的增加。手动检测跨语言窃是困难的,因此,有必要开发一种自动系统来检测这种窃。尽管以欧洲英语,马来英语和印尼英语为形式的关于检测跨语言窃的研究很多,但很少有关于检测阿拉伯英语跨语言窃的研究。这项研究提出了一种使用Winnowing算法的阿拉伯语-英语跨语言窃检测工具。我们根据准确性来评估其性能,并在由Wikipedia文章组成的数据集上进行召回。事实证明,所提出工具的性能良好,具有97%的精度,81%的召回率和89%的F-measure评估指标。结果表明,Winnowing算法可以有效地检测阿拉伯-英语跨语言抄袭。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号