首页>
外国专利>
Being the document similarity calculation device, the clustering device and the document
Being the document similarity calculation device, the clustering device and the document
展开▼
机译:作为文档相似度计算设备,聚类设备和文档
展开▼
页面导航
摘要
著录项
相似文献
摘要
PPROBLEM TO BE SOLVED: To efficiently perform clustering and document extraction by computing document similarity used as an absolute value, with high accuracy without depending on a document size. PSOLUTION: This document similarity computing device is provided with an input part 11 for inputting a document set, and a normalization part 14 for computing similarity used as the relative value between the documents in the inputted document set, respectively on a plurality of combinations of documents by a tf-idf method using a document vector and the importance of words included in the documents, and converting each similarity into an absolute value by normalization. PCOPYRIGHT: (C)2003,JPO
展开▼