【24h】

A Learning-Based Approach for Fetching Pages in WebVigiL

机译:基于学习的WebVigiL中的页面获取方法

获取原文
获取原文并翻译 | 示例

摘要

The World Wide Web is an omni-present and an ever-expanding source of data. Data on the web is constantly increasing and changing. Many a times, users are interested in specific changes to the data on the web. Currently, in order to detect changes of interest, users have to poll the pages periodically and check for the changes of interest. WebVigiL is a general-purpose information monitoring and notification system. It handles the specification, intelligent fetch, detection, and propagation of changes as requested by a user while meeting the quality of service requirements. We use the active capability in the form of event-condition-action (ECA) rules, and a combination of push/pull paradigm for change monitoring. In this paper, we present an overview of the specification language and the run time management of sentinels. We discuss in detail the use of ECA rules for fetching and the adaptive learning algorithm used for fetching pages. We conclude with the implementation status of WebVigiL.
机译:万维网是无处不在且不断扩展的数据源。网络上的数据在不断增加和变化。很多时候,用户对Web数据的特定更改感兴趣。当前,为了检测兴趣的变化,用户必须定期轮询页面并检查兴趣的变化。 WebVigiL是通用信息监视和通知系统。它可以满足用户的要求,同时满足服务质量要求,处理规范,智能地获取,检测和传播更改。我们以事件条件操作(ECA)规则的形式使用主动功能,并结合使用推/拉范例进行变更监视。在本文中,我们概述了规范语言和哨兵的运行时管理。我们将详细讨论如何使用ECA规则进行抓取以及将自适应学习算法用于抓取页面。我们以WebVigiL的实施状态作为结束。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号