首页> 外文会议>Archiving 2013 >HADARA - A Software System for Semi-Automatic Processing of Historical Handwritten Arabic Documents
【24h】

HADARA - A Software System for Semi-Automatic Processing of Historical Handwritten Arabic Documents

机译:HADARA-半自动处理历史阿拉伯手写文件的软件系统

获取原文
获取原文并翻译 | 示例

摘要

Recently, many big libraries all over the world have been scanning their collections to make them publicly available and to preserve historical documents. We present a modular software system which can be used as a tool for semi-automatical processing of historical handwritten Arabic documents. The development of this system is part of the HADARA project which aims for historical document analysis of Arabic manuscripts and consists of a project team including engineers and computer scientists but also users such as linguists and historians. The HADARA system is designed to support script and content analysis, identification, and classification of historical Arabic documents. The system has been created following an iterative development approach, and the current version assists the user in an interactive and partially already in an automatic manner. In this paper, a system overview is given and the first modules are presented which support the annotation of a scanned manuscript in a semi-automatic manner. They comprise page layout analysis, text line segmentation, and transcription. Word spotting is the first application implemented in the HADARA system and its concept is outlined in this paper.
机译:最近,世界上许多大型图书馆都在扫描其馆藏,以使其可以公开获取并保存历史文献。我们提出了一种模块化软件系统,可以用作半自动处理历史手写阿拉伯文档的工具。该系统的开发是HADARA项目的一部分,该项目旨在对阿拉伯手稿进行历史文献分析,并由一个项目团队组成,该团队包括工程师和计算机科学家,还包括语言学家和历史学家等用户。 HADARA系统旨在支持脚本和内容分析,历史阿拉伯文件的识别和分类。该系统是按照迭代开发方法创建的,当前版本以交互方式(部分已经以自动方式)帮助用户。在本文中,给出了系统概述,并介绍了第一个模块,这些模块以半自动方式支持扫描手稿的注释。它们包括页面布局分析,文本行分割和转录。单词发现是在HADARA系统中实现的第一个应用程序,本文概述了它的概念。

著录项

  • 来源
    《Archiving 2013》|2013年|161-166|共6页
  • 会议地点 Washington DC(US)
  • 作者单位

    Institute for Communications Technology, Technische Universitaet Braunschweig, Braunschweig, Germany;

    Institute for Communications Technology, Technische Universitaet Braunschweig, Braunschweig, Germany;

    Institute for Communications Technology, Technische Universitaet Braunschweig, Braunschweig, Germany;

    Institute for Communications Technology, Technische Universitaet Braunschweig, Braunschweig, Germany;

    Ben-Gurion University of the Negev, Be'er-Sheva, Israel;

    Ben-Gurion University of the Negev, Be'er-Sheva, Israel;

    Ben-Gurion University of the Negev, Be'er-Sheva, Israel;

    Faculty of Engineering, Tel-Aviv University and Triangle RD Center, Kafr Qara, Israel;

    Triangle RD Center, Kafr Qara, Israel;

  • 会议组织
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

  • 入库时间 2022-08-26 14:07:44

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号