首页> 外文会议>IAPR International Conference on Document Analysis and Recognition >Identifying Machine-Printed and Handwritten Texts Using DropRegion and Deep Convolutional Network
【24h】

Identifying Machine-Printed and Handwritten Texts Using DropRegion and Deep Convolutional Network

机译:使用DropRegion和深度卷积网络识别机器打印和手写文本

获取原文

摘要

In this paper, we propose a deep convolutional neural network to identify machine-printed and handwritten texts. We also propose a novel data augmentation technique called DropRegion to make up for the lack of available data and enhance the generalization of the model. DropRegion increases data diversity by randomly dropping one of the stroke-containing regions in each raw input text-line image. Two parameters are introduced to make DropRegion adjustable for different data. For distinguishing texts of mixture of five languages including English, Chinese, Japanese, Korean and Russian, we have successfully achieved a very promising accuracy of 99.07% after DropRegion is applied, which is a significantly better performance compared to traditional method (97.91%) and our deep convolutional network baseline (98.75%).
机译:在本文中,我们提出了一种深度卷积神经网络来识别机器打印和手写文本。我们还提出了一种称为DropRegion的新型数据增强技术,以弥补可用数据的不足并增强模型的通用性。 DropRegion通过在每个原始输入文本行图像中随机删除其中一个包含笔画的区域来增加数据多样性。引入了两个参数,以使DropRegion可针对不同的数据进行调整。为了区分包括英语,中文,日语,韩语和俄语在内的五种语言的混合文本,在应用DropRegion后,我们已经成功实现了非常有希望的精度,为99.07 \%,与传统方法相比,其性能要好得多(97.91 \% )和我们的深度卷积网络基线(98.75 \%)。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号