首页> 外国专利> Using computer-implemented analytics to determine plagiarism or heavy paraphrasing

Using computer-implemented analytics to determine plagiarism or heavy paraphrasing

机译:使用计算机实施的分析来确定抄袭或沉重的释义

摘要

Current plagiarism checkers can determine if a document has extensively copied from another but these checkers fail when a document is filled with synonyms. The present invention offers a means of detecting plagiarism involving the use of synonyms or heaving paraphrasing. A source passage from a pertinent corpus is modified by generating augmented tokens which include synonyms for the various terms in the passage. Text analysis may be required to confirm the semantic meaning of a term. The modified passage may be a regular expression. The modified passage can then be compared to a suspect passage using a trigram scorer. If possible plagiarism is detected, an alert message is sent to a reviewer with the passage of interest, the source passage, and an identification of the related corpus.
机译:当前的抄袭检查器可以确定文档是否已从另一个文档广泛复制,但是当文档填充有同义词时,这些检查器会失败。本发明提供了一种检测涉及使用同义词或起伏的抄袭的手段。通过生成增强令牌来修改来自相关语料库的源通道,该增强令牌包括在段落中的各种术语中包含的同义词。可能需要文本分析来确认术语的语义含义。修改的段落可以是正则表达式。然后可以使用Trigram Imorer将修改的通道进行比较。如果检测到可能的抄袭,则通过感兴趣的通过,源通道和相关语料库的识别,向审阅者发送警报消息。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号