首页> 外国专利> DEVICE AND PROGRAM FOR DISCRIMINATING DOCUMENTS WITH THE SAME NAME

DEVICE AND PROGRAM FOR DISCRIMINATING DOCUMENTS WITH THE SAME NAME

机译:区分文件名称的装置和程序

摘要

PROBLEM TO BE SOLVED: To obtain a retrieval result only for a specific person desired by a user by separating documents of different persons with the same names when information on the person is retrieved.;SOLUTION: The present invention accumulates a document retrieved based on a person name in a document storage means, reads the document from the document storage means to extract a feature vector, stores the feature vector in a document vector storage means, reads the feature vector from the document vector storage means, clusters the feature vector, generates a cluster group of similar vectors, stores the cluster group in a cluster storage means, acquires a document corresponding to a vector included in the cluster vector from the cluster storage means from the document storage means to generate a cluster of the vector, and outputs the cluster of the document.;COPYRIGHT: (C)2009,JPO&INPIT
机译:解决的问题:通过在检索有关人的信息时分离具有相同名称的不同人的文档,仅获得用户期望的特定人的检索结果。解决方案:本发明累积基于文档存储装置中的人名,从文档存储装置读取文档以提取特征向量,将特征向量存储在文档向量存储装置中,从文档向量存储装置中读取特征向量,将特征向量聚类,生成相似矢量的群集组,将群集组存储在群集存储装置中,从文档存储装置从群集存储装置获取与包含在群集矢量中的矢量相对应的文档,以生成矢量的群集,并输出文件丛集。;版权:(C)2009,JPO&INPIT

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号