首页> 外文会议>International Conference on Document Analysis and Recognition >Japanese Character Segmentation for Historical Handwritten Official Documents Using Fully Convolutional Networks
【24h】

Japanese Character Segmentation for Historical Handwritten Official Documents Using Fully Convolutional Networks

机译:使用完全卷积网络对历史手写正式文档的日语字符分割

获取原文

摘要

This paper proposes a character segmentation method using a fully convolutional network (FCN) and a post-processing phase. The network is trained with five-channel images that indicate five kinds of zones within the bounding box for each character-the top half, bottom half, left half, right half, and center. The post-processing step reconstructs the bounding boxes for characters from the five-channel image of the FCN output. The proposed method possesses the following advantages: (1) It is possible to process input images including multiple text lines directly; in other words, a text line segmentation process is unnecessary. (2) It does not rely upon character recognition. (3) It is robust to variations in the sizes of characters and the gaps between characters and also to cursive characters or character overlap. In the experiment of character segmentation, the accuracy ratio was 95% for real images of historical handwritten official documents written in Japanese.
机译:本文提出了一种使用全卷积网络(FCN)和后处理阶段的字符分割方法。用五通道图像训练网络,该图像指示每个字符在边界框内的五种区域-上半部分,下半部分,左半部分,右半部分和中心。后处理步骤从FCN输出的五通道图像重建字符的边框。所提出的方法具有以下优点:(1)可以直接处理包括多行文本的输入图像;换句话说,文本行分割过程是不必要的。 (2)它不依赖于字符识别。 (3)它对于字符尺寸的变化和字符之间的间隙以及草书字符或字符重叠具有鲁棒性。在字符分割实验中,用日语书写的历史手写正式文件的真实图像的准确率是95%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号