Focused Crawling Using Temporal Difference-Learning

机译：使用时间差异学习进行集中爬行

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

This paper deals with the problem of constructing an intelligent Focused Crawler, i.e. a system that is able to retrieve documents of a specific topic from the Web. The crawler must contain a component which assigns visiting priorities to the links, by estimating the probability of leading to a relevant page in the future. Reinforcement Learning was chosen as a method that fits this task nicely, as it provides a method for rewarding intermediate states to the goal. Initial results show that a crawler trained with Reinforcement Learning is able to retrieve relevant documents after a small number of steps.

机译：本文讨论了构建智能的“集中抓取工具”的问题，即一个能够从Web检索特定主题的文档的系统。搜寻器必须包含一个组件，该组件通过估计将来导致相关页面访问的可能性来为链接分配访问优先级。选择强化学习作为一种非常适合此任务的方法，因为它提供了一种奖励中间状态达到目标的方法。初步结果表明，经过强化学习培训的爬虫能够在少量步骤后检索相关文档。

著录项

来源
《Hellenic Conference on AI(Artificial Intellignece)(SENTN 2004); 20040505-20040508; Samos; GR》|2004年|P.142-153|共12页
会议地点 Samos(GR);Samos(GR)
作者
Alexandros Grigoriadis; Georgios Paliouras;
展开▼
作者单位

Software and Knowledge Engineering Laboratory Institute of Informatics and Telecommunications, National Centre for Scientific Research "Demokritos" 153 10 Ag. Paraskevi, Athens, Greece;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类人工智能理论;
关键词
machine learning; reinforcement learning; web mining; focused crawling;

机译：机器学习;强化学习; Web挖掘;集中爬网;

相似文献

外文文献
中文文献
专利

1. Keyword weight optimization using gradient strategies in event focused web crawling [J] . Rajiv S., Navaneethan C. Pattern recognition letters . 2021,第Feba期

机译：关键词权重优化在活动中使用渐变策略的重点策略
2. Emotional attitudes towards procrastination in people: A large-scale sentiment-focused crawling analysis [J] . Chen Zhiyi, Zhang Rong, Xu Ting, Computers in Human Behavior . 2020,第Sepa期

机译：对人们拖延的情感态度：一个大规模的情绪集中爬行分析
3. FOCUSED WEB CRAWLING FOR HIGH PERFORMANCE SEARCH ENGINES: ISSUES, TECHNIQUES AND SYSTEMS [J] . SUSHIL KUMAR, NARESH CHAUHAN International journal of computational intelligence theory and practice . 2020,第1期

机译：专注于高性能搜索引擎的Web爬网：问题，技术和系统
4. Focused Crawling Using Temporal Difference-Learning [C] . Alexandros Grigoriadis, Georgios Paliouras Hellenic Conference on AI . 2004

机译：使用时间差异学习的重点爬行
5. A novel hybrid focused crawling algorithm to build domain-specific collections. [D] . Chen, Yuxin. 2007

机译：一种新颖的混合重点爬网算法，用于构建特定于域的集合。
6. Domain adaptation of statistical machine translation with domain-focused web crawling [O] . Pavel Pecina, Antonio Toral, Vassilis Papavassiliou, -1

机译：统计机器翻译的领域适应和以领域为中心的网络爬网
7. Focused Crawling using Temporal Difference-Learning [O] . Ros Grigoriadis, Georgios Paliouras 2008

机译：使用时间差异学习进行集中爬行
8. Focused Crawling of the Deep Web Using Service Class Descriptions [R] . Rocco, D., Liu, L., Critchlow, T. 2005

机译：使用服务类描述重点对Deep Web进行爬网

Focused Crawling Using Temporal Difference-Learning

摘要

著录项

相似文献

相关主题

期刊订阅