搜索引擎既是网络信息检索的入口,同时也是一个在线庞大的信息库.提出一个在线论文复制检测算法,通过固定长度的滑动窗口对待检测文件进行切分,利用搜索引擎的精确检索功能,用定量的方法检测论文与网络信息的相似度.实验证明,该算法实用且可行.%Search engine is the entrance for information retrieval.lt is also a big information warehouse.This paper proposes an algorithm to detect plagiarism quantitatively based on natural language segment width fixed-width sliding window and precise retrieval function of search engine.The experiments suggest that this algorithm is practical and feasible.
展开▼