首页> 外国专利> METHOD AND DEVICE FOR FORMING RETRIEVAL INDEX BY COMMUNITY EXTRACTION

METHOD AND DEVICE FOR FORMING RETRIEVAL INDEX BY COMMUNITY EXTRACTION

机译:通过社区提取形成检索索引的方法和设备

摘要

PROBLEM TO BE SOLVED: To form a retrieval index appropriate to general Web pages as well as blogs.;SOLUTION: The method comprises steps of: grouping queries having a strong correlation from a query log for clustering; labeling term groups of clusters; forming a route set based on the term of each cluster; forming a base set based on each node contained in the route set; performing community extraction based on a unique vector corresponding to a maximum unique value and a unique vector corresponding to a unique value other than the maximum unique value from the base set by HITS algorithm; extracting phrase from the authority and hub of the community extraction resu forming a directory based on the cluster, label, term and phrase; and forming a retrieval index based on the directory.;COPYRIGHT: (C)2008,JPO&INPIT
机译:解决的问题:形成适合于一般网页和博客的检索索引。解决方案:该方法包括以下步骤:将来自查询日志的具有强相关性的查询分组以进行聚类;标记集群的术语组;根据每个聚类的项形成一个路由集;基于路由集合中包含的每个节点形成基础集合;基于对应于最大唯一值的唯一向量和对应于除HITS算法设置的基数中的最大唯一值以外的唯一值的唯一向量进行社区提取;从社区提取结果的权威和中心中提取短语;根据集群,标签,术语和短语形成目录; ;并基于该目录形成检索索引。; COPYRIGHT:(C)2008,JPO&INPIT

著录项

  • 公开/公告号JP2008191877A

    专利类型

  • 公开/公告日2008-08-21

    原文格式PDF

  • 申请/专利权人 YAHOO JAPAN CORP;

    申请/专利号JP20070024761

  • 发明设计人 WANG SHAO-CHI;

    申请日2007-02-02

  • 分类号G06F17/30;

  • 国家 JP

  • 入库时间 2022-08-21 20:25:02

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号