首页> 外国专利> METHOD AND SYSTEM FOR SUBJECT RELEVANT WEB PAGE FILTERING BASED ON NAVIGATION PATHS INFORMATION

METHOD AND SYSTEM FOR SUBJECT RELEVANT WEB PAGE FILTERING BASED ON NAVIGATION PATHS INFORMATION

机译:基于导航路径信息的主题相关网页过滤方法和系统

摘要

Method and system to utilize the set of navigation paths of web pages as the contextual information for subject relevant web page filtering with high accuracy are provided. The method comprises the steps of: obtaining all web pages in one or more web pages collections; collecting information of the links among the obtained web pages; extracting, based on the collected links, a set of navigation paths of each of the obtained web pages; and filtering the obtained web pages based on the extracted set of navigation paths to obtain desired web pages. In some embodiments, the extraction of the navigation paths is preferably performed on the navigation links of the web pages. Therefore, the method also comprises the process for deleting non-navigation links from all the links of the web pages. Compared with the prior art, the present invention can utilize the contextual information of the web pages for web page filtering in a more sufficient way, thereby improving the accuracy and objectivity of the web page filtering.
机译:提供了一种方法和系统,该方法和系统利用网页的导航路径的集合作为上下文信息,以高精度地进行主题相关的网页过滤。该方法包括以下步骤:获得一个或多个网页集合中的所有网页;以及收集获取的网页之间的链接信息;根据收集到的链接,提取每个获取的网页的导航路径;基于提取的导航路径集合过滤获取的网页,以获取期望的网页。在一些实施例中,导航路径的提取优选地在网页的导航链接上执行。因此,该方法还包括用于从网页的所有链接中删除非导航链接的过程。与现有技术相比,本发明可以更充分地利用网页的上下文信息进行网页过滤,从而提高了网页过滤的准确性和客观性。

著录项

  • 公开/公告号US2009083244A1

    专利类型

  • 公开/公告日2009-03-26

    原文格式PDF

  • 申请/专利权人 JIANQIANG LI;YU ZHAO;

    申请/专利号US20080236166

  • 发明设计人 YU ZHAO;JIANQIANG LI;

    申请日2008-09-23

  • 分类号G06F7/06;G06F17/00;G06F17/30;

  • 国家 US

  • 入库时间 2022-08-21 19:33:21

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号