首页> 外国专利> APPARATUS FOR ANALYZING SET OF DOCUMENTS, METHOD FOR ANALYZING SET OF DOCUMENTS, PROGRAM IMPLEMENTING THIS METHOD, AND RECORDING MEDIUM STORING THIS PROGRAM

APPARATUS FOR ANALYZING SET OF DOCUMENTS, METHOD FOR ANALYZING SET OF DOCUMENTS, PROGRAM IMPLEMENTING THIS METHOD, AND RECORDING MEDIUM STORING THIS PROGRAM

机译:用于分析文件集的设备,用于分析文件集的方法,用于实施该方法的程序以及用于对该程序进行记录的介质

摘要

PROBLEM TO BE SOLVED: To allow an apparatus for analyzing a set of documents to determine that even documents issued on different dates in terms of time are highly related to each other if their contents are highly correlated to each other.;SOLUTION: The apparatus for analyzing a set of documents includes a document set specifying part 10 for specifying a set of documents according to specification requirements; a content similarity evaluating part 20 for evaluating a similarity in content among the documents included in the set of documents; a time stamp similarity evaluating part 30 for evaluating a similarity in time among the documents included in the set of documents; a relationship extracting part 40 for extracting a relationship among the documents based on the similarities in content and in time; a centricity determining part 50 for calculating the centricity of the documents based on the relationship among the documents; an information analyzing part 100 for specifying a theme word contained in the set of documents, a set of documents related to the theme word, and the role of documents in the set of documents from the entire set of documents based on the relationship among the documents and the centricity of the individual documents obtained; and an information output part 110 for visualizing and outputting the specified set of documents.;COPYRIGHT: (C)2008,JPO&INPIT
机译:要解决的问题:使一种用于分析一组文档的设备能够确定,即使在时间上在不同日期发布的文档,如果它们的内容彼此之间具有高度相关性,则它们彼此之间也具有高度相关性。分析文档集合包括文档集合指定部分10,用于根据规格要求指定文档集合;内容相似度评估部分20,用于评估在该组文档中包括的文档之间的内容相似度;时间戳相似度评估部分30,用于评估包括在文档集合中的文档之间的时间相似度;关系提取部分40,用于根据内容和时间上的相似性提取文档之间的关系;中心度确定部分50,用于基于文档之间的关系来计算文档的中心度;信息分析部分100,用于基于文档之间的关系来指定文档集合中包含的主题词,与该主题词相关的文档集合以及文档在该文档集合中的角色以及获得的单个文件的中心性;信息输出部分110,用于可视化和输出指定的一组文档。COPYRIGHT:(C)2008,JPO&INPIT

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号