首页>
外国专利>
DOCUMENT SEARCH DEVICE AND METHOD BASED ON JACCARD MODEL
DOCUMENT SEARCH DEVICE AND METHOD BASED ON JACCARD MODEL
展开▼
机译:基于Jaccard模型的文档搜索设备和方法
展开▼
页面导航
摘要
著录项
相似文献
摘要
The present invention relates to a search apparatus and method based on similarity score normalization, and among embodiments, extracting a token set including at least one token for each of at least one document and performing N hashes (where N is a natural number) A document index generator that generates at least one document index by applying each function to the at least one token to generate N hash codes, each of the at least one document index and given It may include a document similarity calculator that calculates and normalizes the similarity between documents, and a similarity rank determiner that determines a similarity rank with the at least one document index based on the similarity.
展开▼