首页> 外文期刊>International journal on digital libraries >A document comparison scheme for secure duplicate detection
【24h】

A document comparison scheme for secure duplicate detection

机译:用于安全重复检测的文档比较方案

获取原文
获取原文并翻译 | 示例
       

摘要

The ever-growing volumes of textual information from various sources have fostered the development of digital libraries, making digital content readily accessible but also easy for malicious users to plagiarize, thus giving rise to security problems. In this paper, we introduce a duplicate detection scheme that is able to determine, with a particularly high accuracy, the degree to which one document is similar to another. Our pairwise document comparison scheme detects the resemblance between the content of documents by considering document chunks, representing contexts of words selected from the text. The resulting duplicate detection technique presents a good level of security in the protection of intellectual property while improving the availability of the data stored in the digital library and the correctness of the search results. Finally, the paper addresses efficiency and scalability issues by introducing new data reduction techniques.
机译:来自各种来源的文本信息的不断增长促进了数字图书馆的发展,使数字内容易于访问,也易于恶意用户users窃,从而引发了安全问题。在本文中,我们介绍了一种重复检测方案,该方案能够以极高的准确度确定一个文档与另一个文档的相似程度。我们的成对文档比较方案通过考虑文档块(代表从文本中选择的单词的上下文)来检测文档内容之间的相似性。所得的重复检测技术在保护知识产权的同时提供了良好的安全性,同时提高了数字图书馆中存储的数据的可用性和搜索结果的正确性。最后,本文通过介绍新的数据缩减技术解决了效率和可伸缩性问题。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号