首页> 外文会议>IEEE Symposium on Computer-Based Medical Systems >Web page downloading and classification
【24h】

Web page downloading and classification

机译:网页下载和分类

获取原文

摘要

This paper describes the processes of downloading and classifying Web-based articles in online medical journals as a preliminary step to extracting bibliographic data to populate MEDLINE the widely used database of the National Library of Medicine (NLM). The processes are combined to develop an automated system named "Web Page Downloading and Classification". The system downloads the Web pages using Microsoft's Windows Internet API tool called WinInet, and a combination of several Artificial Intelligence (AI) techniques including the Breadth-First search algorithm and the Constraint Satisfaction method. The Breadth-First search algorithm and the Constraint Satisfaction method are then used to traverse the Web page's links, identify these pages as abstract, full text, PDF or image files, recognize and generate the successors of the downloading pages.
机译:本文介绍了在线医学期刊下载和分类基于网络的文章作为提取书目数据的初步步骤,以填充MEDLINES国家医学图书馆的广泛使用的数据库(NLM)。该过程组合为开发名为“网页下载和分类”的自动化系统。系统使用Microsoft的Windows Internet API工具下载了名为WinInet的网页,以及多种人工智能(AI)技术的组合,包括广度 - 第一搜索算法和约束满意度方法。然后,广度 - 首先搜索算法和约束满意度方法遍历网页的链接,将这些页面标识为抽象,全文,PDF或图像文件,识别并生成下载页面的后续销。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号