首页> 外文会议>ASIST Annual Meeting >An Architecture for SCS: A Specialized Web Crawler on the Topic of Security
【24h】

An Architecture for SCS: A Specialized Web Crawler on the Topic of Security

机译:用于SCS的架构:安全性主题的专业Web爬网

获取原文

摘要

Mining for correct and relevant information in the World Wide Web is a difficult task, handled by Web crawlers. This study outlines the components of a specialized crawler on the topic of security (SCS) that heavily makes use of artificial neural networks and rule-based expert systems to establish successful focused crawling on the topic of security. SCS is designed to find, index and follow the updates of Web pages of interest, and proposes new approaches for reaching relevant pages, which might stay hidden to other crawling approaches. SCS consists of four new page explorers, a database of relevant pages, a relevance evaluator using artificial neural networks and an updater using rule-based expert systems. SCS is a multi-threaded multi-object Java Applet and Application combination with embedded SQL and PHP elements and is applicable on single or multiple machines through parallel processing with its expandable and modular structure.
机译:挖掘万维网中的正确和相关信息是一项艰巨的任务,由Web爬网处理。本研究概述了一个专业履历的专业履历对安全性神经网络和基于规则的专家系统的主题,以建立成功的重点爬在安全主题上。 SCS旨在找到,索引和遵循关注网页的更新,并提出了达到相关页面的新方法,这可能会隐藏在其他爬行方法中。 SCS由四个新页面资源管理器,相关页面数据库,使用人工神经网络的相关性评估器和使用基于规则的专家系统的更新程序。 SCS是一种多线程多对象Java applet和应用程序组合,具有嵌入式SQL和PHP元件,并且通过并行处理可在其可扩展和模块化结构上适用于单个或多台机器。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号