首页> 外文OA文献 >An IR-based Approach Utilising Query Expansion for Plagiarism Detection in MEDLINE
【2h】

An IR-based Approach Utilising Query Expansion for Plagiarism Detection in MEDLINE

机译:基于IR的方法利用查询扩展进行mEDLINE中的抄袭检测

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

The identification of duplicated and plagiarisedudpassages of text has become an increasingly active area ofudresearch. In this paper we investigate methods for plagiarismuddetection that aim to identify potential sources of plagiarismudfrom MEDLINE, particularly when the original text has beenudmodified through the replacement of words or phrases. Audscalable approach based on Information Retrieval is used toudperform candidate document selection - the identification of audsubset of potential source documents given a suspicious textud- from MEDLINE. Query expansion is performed using theudULMS Metathesaurus to deal with situations in which originaluddocuments are obfuscated. Various approaches to Word SenseudDisambiguation are investigated to deal with cases where thereudare multiple Concept Unique Identifiers (CUIs) for a given term.udResults using the proposed IR-based approach outperform audstate-of-the-art baseline based on Kullback-Leibler Distance.
机译:文本的重复和抄袭泛滥的识别已成为 udresearch越来越活跃的领域。在本文中,我们研究了窃 uddetect的方法,旨在从MEDLINE中识别potential窃的潜在来源,尤其是当通过替换单词或短语对原始文本进行了ududed时。基于信息检索的可扩展的方法用于胜过候选文档的选择-在MEDLINE中给定可疑文本的情况下,识别潜在源文档的子集。使用 udULMS Metathesaurus执行查询扩展,以处理原始 uddocument被混淆的情况。研究了多种解决Word Sense ud歧义的方法,以应对在给定术语中存在 uda多个概念唯一标识符(CUI)的情况。 ud使用建议的基于IR的方法的结果优于基于 ud-最新技术的基线在Kullback-Leibler距离上。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号