首页> 外国专利> Method of web crawling utilizing crawl numbers

Method of web crawling utilizing crawl numbers

机译:利用爬网号进行网络爬网的方法

摘要

A computer based system and method of retrieving information pertaining to electronic documents on a computer network is disclosed. The method includes maintaining a database that associates each electronic document with a corresponding crawl number that indicates the most recent crawl during which a change to the document was detected. During a subsequent crawl, electronic documents that have changed since the previous crawl are retrieved, and selected data is stored in a database. The retrieved document information is marked with a crawl number. During subsequent searches, crawl numbers are used to determine documents that have changed since a specified crawl.
机译:公开了一种基于计算机的系统和方法,该系统和方法在计算机网络上检索与电子文档有关的信息。该方法包括维护数据库,该数据库将每个电子文档与对应的爬网编号相关联,该爬网编号指示在最近的爬网期间检测到文档的更改。在后续爬网期间,将检索自上一次爬网以来发生更改的电子文档,并将选定的数据存储在数据库中。检索到的文档信息标记有爬网编号。在后续搜索期间,爬网编号用于确定自指定爬网以来已更改的文档。

著录项

  • 公开/公告号US6638314B1

    专利类型

  • 公开/公告日2003-10-28

    原文格式PDF

  • 申请/专利权人 MICROSOFT CORPORATION;

    申请/专利号US19980105758

  • 发明设计人 SANKRANT SANU;DMITRIY MEYERZON;

    申请日1998-06-26

  • 分类号G06F70/20;

  • 国家 US

  • 入库时间 2022-08-22 00:05:38

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号