Current plagiarism checkers can determine if a document has extensively copied from another but these checkers fail when a document is filled with synonyms. The present invention offers a means of detecting plagiarism involving the use of synonyms or heaving paraphrasing. A source passage from a pertinent corpus is modified by generating augmented tokens which include synonyms for the various terms in the passage. Text analysis may be required to confirm the semantic meaning of a term. The modified passage may be a regular expression. The modified passage can then be compared to a suspect passage using a trigram scorer. If possible plagiarism is detected, an alert message is sent to a reviewer with the passage of interest, the source passage, and an identification of the related corpus.
展开▼