首页> 外文会议>National conference on integrated online library systems >Automating Data Entry for Online Biomedical Databases
【24h】

Automating Data Entry for Online Biomedical Databases

机译:自动化在线生物医学数据库的数据条目

获取原文

摘要

The Lister Hill National Center for Biomedical Communications, an R&D division of the National Library of Medicine (NLM), is engaged in developing systems for automating the extraction of information from biomedical journals to create bibliographic records in MEDLINE~R, NLM's premier online database used worldwide. The first phase of this project has resulted in a system that involves scanning and converting (by OCR) the abstracts that appear in journal articles, while keyboarding the remaining fields. A second generation system is being designed to automate the entry of other fields such as author names, institutional affiliations, page numbers, article titles and others. This system will employ scanning and OCR as well as document image analysis techniques that will automatically zone the scanned pages, identify the zones as particular fields, and reformat the field syntax to adhere to conventional practice in MEDLINE. This paper describes the first generation system currently used for production, and the work toward the design of the second generation system.
机译:Lister Hill国家生物医学通信中心,国家医学图书馆的R&D司(NLM),从事发展系统,以自动提取生物医学期刊的信息,以创建Medline〜R,NLM首屈一指的在线数据库中的书目记录全世界。该项目的第一阶段导致了一个系统,涉及扫描和转换(通过OCR)期刊文章中出现的摘要,同时键盘剩余字段。第二代系统正在旨在自动进入其他字段,例如作者名称,机构附属关系,页码,文章标题等。该系统将采用扫描和OCR以及文件图像分析技术,这些技术将自动区域扫描页面,识别特定字段的区域,并重新格式化以遵守Medline的传统实践。本文介绍了目前用于生产的第一代系统,以及对第二代系统设计的工作。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号