首页> 外文会议>Pacific Rim international conference on artificial intelligence >An End-to-End Preprocessor Based on Adversiarial Learning for Mongolian Historical Document OCR
【24h】

An End-to-End Preprocessor Based on Adversiarial Learning for Mongolian Historical Document OCR

机译:基于对抗学习的蒙古历史文献OCR端到端预处理器

获取原文

摘要

In Mongolian historical document recognition, preprocessing mainly involves image binarization and denoising. This is a challenging task and greatly effects the accuracy of the recognition result. Concerning the fact that image binarization and denoising are both image-to-image tasks, this paper proposes an end-to-end preprocessor for Mongolian historical document OCR. The preprocessor is trained in an adversarial learning fashion and deal with binarization and denoising simultaneously. The input of the preprocessor is the color image of Mongolian document images, and the output is the clean binary images which can be used for word recognition. The preprocessor was trained on a limited dataset and performed better than the combination of binarization and denoising methods used in earlier Mongolian historical document OCR systems.
机译:在蒙古历史文献识别中,预处理主要涉及图像二值化和去噪。这是一项艰巨的任务,并且极大地影响识别结果的准确性。针对图像二值化和去噪都是图像到图像的任务这一事实,本文提出了蒙古历史文档OCR的端到端预处理器。预处理器以对抗性学习方式进行训练,并同时处理二值化和去噪。预处理器的输入是蒙古文文档图像的彩色图像,输出是可用于单词识别的纯净二进制图像。该预处理器在有限的数据集上进行了训练,其性能优于早期蒙古历史文献OCR系统中使用的二值化和去噪方法相结合。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号