首页> 外文会议>International Conference on Intelligent Computing and Integrated Systems >Research and design of the crawler system in a vertical search engine
【24h】

Research and design of the crawler system in a vertical search engine

机译:垂直搜索引擎中爬虫系统的研究与设计

获取原文

摘要

The crawler system in a vertical search engine should format a representative sample web page so at to make sure that the page could meet the W3C standards, which make it available that the processed page can be resolved by the visual XPath generator and then the desired XPath value will be found out. In batch-data-extraction, some exact data will be available when object web pages are parsed by the crawler system. A vertical search engine can extract the necessary data and segment Chinese words at first, and then the data will be presented on web pages. The data structuring process after the data extraction distinguishes a vertical search engine from a traditional search engine. The crawler system that can extract professional information on the Internet and process the information preliminarily is an indispensable part of a vertical search engine.
机译:垂直搜索引擎中的搜寻器系统应设置代表性的示例网页的格式,以确保该网页符合W3C标准,从而使处理后的网页可以由可视XPath生成器解析,然后由所需的XPath解析。价值将被发现。在批处理数据提取中,当搜寻器系统解析对象网页时,将提供一些确切的数据。垂直搜索引擎可以首先提取必要的数据并分割中文单词,然后将这些数据显示在网页上。数据提取后的数据结构化过程将垂直搜索引擎与传统搜索引擎区分开来。可以在Internet上提取专业信息并进行初步处理的爬虫系统是垂直搜索引擎必不可少的部分。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号