【24h】

Learning to Surface Deep Web Content

机译:学习显示深层Web内容

获取原文

摘要

We propose a novel deep web crawling framework based on reinforcement learning. The crawler is regarded as an agent and deep web database as the environment. The agent perceives its current state and submits a selected action (query) to the environment according to Q-value. Based on the framework we develop an adaptive crawling method. Experimental results show that it outperforms the state of art methods in crawling capability and breaks through the assumption of full-text search implied by existing methods.
机译:我们提出了一种基于强化学习的新颖的深层网络爬网框架。搜寻器被视为代理,深度Web数据库被视为环境。代理感知其当前状态,并根据Q值向环境提交选定的操作(查询)。基于该框架,我们开发了一种自适应爬网方法。实验结果表明,它在抓取能力方面优于现有方法,并突破了现有方法所隐含的全文搜索的假设。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号