首页>
外国专利>
INTER-DOCUMENT SIMILARITY CALCULATION DEVICE, INTER-DOCUMENT SIMILARITY CALCULATION METHOD AND INTER-DOCUMENT SIMILARITY CALCULATION PROGRAM
INTER-DOCUMENT SIMILARITY CALCULATION DEVICE, INTER-DOCUMENT SIMILARITY CALCULATION METHOD AND INTER-DOCUMENT SIMILARITY CALCULATION PROGRAM
展开▼
机译:文件间相似度计算装置,文件间相似度计算方法和文件间相似度计算程序
展开▼
页面导航
摘要
著录项
相似文献
摘要
PROBLEM TO BE SOLVED: To provide an inter-document similarity calculation device capable of calculating similarity with high accuracy while preventing a burden from being excessive in calculating the similarity between documents.;SOLUTION: A device 100 comprises: a unit (101) for, when representing the total number of characters constituting a sentence included in each of multiple documents as N, generating suffix part information that shows a suffix part equivalent to a remaining character string after excluding i characters from the top of the sentence, for each of integers i from 0 to N-1; a unit (102) for selecting a suffix part generated based on the multiple sentences as a reference suffix part from the suffix parts; a unit (103) for, with respect to each of the multiple documents, generating similarity basic information that shows whether or not the document includes the reference suffix part; and a unit (104) for calculating the similarity showing a level of similarity between a first document and a second document based on the similarity basic information generated for the first document and the similarity basic information generated for the second document.;COPYRIGHT: (C)2012,JPO&INPIT
展开▼