首页>
外国专利>
Reliability of duplicate document detection algorithms
Reliability of duplicate document detection algorithms
展开▼
机译:重复文档检测算法的可靠性
展开▼
页面导航
摘要
著录项
相似文献
摘要
In a single-signature duplicate document system, a secondary set of attributes is used in addition to a primary set of attributes so as to improve the precision of the system. When the projection of a document onto the primary set of attributes is below a threshold, then a secondary set of attributes is used to supplement the primary lexicon so that the projection is above the threshold.
展开▼