【24h】

Virtual Library of' Multilingual Multicontents Scientific Journals

机译:多语种多内容科学期刊虚拟图书馆

获取原文
获取原文并翻译 | 示例

摘要

The work presented in this paper is tackling the document analysis and recognition problem, in line with a virtual library project. It presents the design and the implementation of a software platform allowing to place at the disposal of end users of a virtual library, automatic processing to display and retrieve, remotely and locally, dematerialized documents from databases made up of multilingual and multicontents scientific journals. The prototype was called BVMuLS. After a chain of digitalization, conversion and compression, the scanned documents are put at the format DjVu and the articles are safeguarded with format PDF. The whole is then stored in a MySQL database. The outline journal is used as index. The contents will be presented at the user as an XML document, allowing a lot of services: consultation, printing and safeguard locally. With this intention, we initially start with the segmentation of the image of the contents in order to extract the area where the text of the contents is. Once the area recognized, we apply a tagging technique so to find the various identifying fields of the articles, namely : the title, the author, the translator and the number of pages. In order to generate the XML file corresponding to this contents, other metadata, such as the name of the review, the volume of the review, the date of publication, etc, are added with those referring to the various articles published in the review.
机译:本文提出的工作是根据虚拟图书馆项目来解决文档分析和识别问题。它介绍了一个软件平台的设计和实现,该软件平台允许虚拟图书馆的最终用户使用,可以自动处理以显示和检索本地和非本地化的,由多种语言和多种内容的科学期刊组成的数据库中的非物化文档。原型称为BVMuLS。经过一连串的数字化,转换和压缩后,扫描的文档将以DjVu格式放置,并且文章将使用PDF格式进行保护。然后将整个存储在MySQL数据库中。大纲日记帐用作索引。内容将以XML文档的形式呈现给用户,从而提供许多服务:咨询,打印和本地维护。出于这个目的,我们首先从内容图像的分割开始,以提取内容文本所在的区域。一旦识别出该区域,我们将应用标签技术来查找文章的各种标识字段,即:标题,作者,翻译者和页数。为了生成与该内容相对应的XML文件,将其他元数据(例如,评论的名称,评论的数量,出版日期等)与引用该评论中发表的各种文章的内容一起添加。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号