首页>
外国专利>
DOCUMENT SEARCH DEVICE AND METHOD BASED ON JACCARD MODEL
DOCUMENT SEARCH DEVICE AND METHOD BASED ON JACCARD MODEL
展开▼
机译:基于Jaccard模型的文档搜索设备和方法
展开▼
页面导航
摘要
著录项
相似文献
摘要
The present invention relates to a similarity score normalization-based search apparatus and method, among embodiments, extracting a set of tokens including at least one token for each of at least one document, and hashes of N (where N is a natural number) By applying each function to the at least one token to generate N number of hash codes, a document index generator that generates at least one document index, a Jaccard model based on each of the at least one document index and a given A document similarity calculation unit that calculates and normalizes the similarity between documents, and a similarity ranking unit that determines a similarity ranking with the at least one document index based on the similarity level.
展开▼