【24h】

Focused Crawling Using Temporal Difference-Learning

机译:使用时间差异学习进行集中爬行

获取原文
获取原文并翻译 | 示例

摘要

This paper deals with the problem of constructing an intelligent Focused Crawler, i.e. a system that is able to retrieve documents of a specific topic from the Web. The crawler must contain a component which assigns visiting priorities to the links, by estimating the probability of leading to a relevant page in the future. Reinforcement Learning was chosen as a method that fits this task nicely, as it provides a method for rewarding intermediate states to the goal. Initial results show that a crawler trained with Reinforcement Learning is able to retrieve relevant documents after a small number of steps.
机译:本文讨论了构建智能的“集中抓取工具”的问题,即一个能够从Web检索特定主题的文档的系统。搜寻器必须包含一个组件,该组件通过估计将来导致相关页面访问的可能性来为链接分配访问优先级。选择强化学习作为一种非常适合此任务的方法,因为它提供了一种奖励中间状态达到目标的方法。初步结果表明,经过强化学习培训的爬虫能够在少量步骤后检索相关文档。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号