首页> 外文会议>12th International Conference on Frontiers in Handwriting Recognition >Semi-automatic Annotation Tool for Medieval Manuscripts
【24h】

Semi-automatic Annotation Tool for Medieval Manuscripts

机译:中世纪手稿的半自动注释工具

获取原文

摘要

Medieval manuscript layouts are quite complex. They contain textual elements such as insertions, annotations, and corrections. They may be richly decorated with ornaments, illustrations, and decorative initials making their layout even more complex. In this paper we describe a semi-automatic tool which annotates medieval manuscripts using our generic format. This format allows to represent the physical structure of such manuscripts. Our semi-automatic tool is composed of two parts. The first part achieves a layout analysis which automatically segments manuscripts into text blocks and text lines. That is, a Multi-Layer Perceptron (MLP) identifies layout elements by using color features, it extracts the textual content image of the manuscript. Then, a segmentation based on Connected Component (CC) is performed on the textual content in order to retrieve text blocks and lines. The second part provides an interactive interface allowing the user to customize the automatic analysis, to visualize its results, and to correct them. Our tool is still a prototype, nevertheless, first experiments give encouraging results. Thus, in this paper, we show how to generate a ground truth for medieval manuscripts layouts.
机译:中世纪手稿的布局非常复杂。它们包含文本元素,例如插入,注释和更正。它们可能装饰有精美的装饰品,插图和装饰缩写,从而使其布局更加复杂。在本文中,我们描述了一种半自动工具,该工具使用我们的通用格式注释中世纪手稿。这种格式可以表示此类手稿的物理结构。我们的半自动工具由两部分组成。第一部分实现了布局分析,该分析自动将手稿分成文本块和文本行。也就是说,多层感知器(MLP)通过使用颜色特征来识别布局元素,并提取手稿的文本内容图像。然后,对文本内容执行基于Connected Component(CC)的分段,以检索文本块和行。第二部分提供了一个交互式界面,允许用户自定义自动分析,可视化其结果并进行更正。我们的工具仍然是原型,但是,第一个实验给出了令人鼓舞的结果。因此,在本文中,我们展示了如何为中世纪手稿版式生成基本事实。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号