首页>
外国专利>
MATCHING ENGINE WITH SIGNATURE GENERATION AND RELEVANCE DETECTION
MATCHING ENGINE WITH SIGNATURE GENERATION AND RELEVANCE DETECTION
展开▼
机译:具有签名生成和相关性检测的匹配引擎
展开▼
页面导航
摘要
著录项
相似文献
摘要
A system and a method generates at least one signature associated with document. In one embodiment, a document comprised of text is received and parsed to generate a token set. The token set includes a plurality of tokens. Each token corresponds to the text in the document that is separated by a predefined character characteristic. A score is calculated for each token in the token set based on a frequency and distribution of the text in the document. Each token is then ranked based on the calculated score. A subset of the ranked tokes is selected and a signature is generated for each occurrence of the selected tokens. The selected list of signatures is then output.
展开▼