首页> 外文会议>International Conference on Design Science Research in Information Systems and Technology >A Novel Hybrid Optical Character Recognition Approach for Digitizing Text in Forms
【24h】

A Novel Hybrid Optical Character Recognition Approach for Digitizing Text in Forms

机译:文本数字化的新型混合光学字符识别方法

获取原文

摘要

The huge amount of document-based processes has considerably contributed to the need of automated systems which are able to appropriately digitize text in documents concerning forms. For example, the text in scanned administrative forms is not accessible without an adequate conversion from pixels to editable text. Against this background, many organizations tap the potential of Optical Character Recognition (OCR) as it is capable of supporting the digitization of text in documents. However, there is still a lack of integrated OCR approaches, considering both handwritten and machine printed texts, which are both of major importance in the context of digitizing text in forms. To address this problem, we propose a new hybrid OCR approach recognizing handwritten and machine printed text based on neural networks in an integrated perspective. We demonstrate the practical applicability of our approach using publicly available forms on which the approach could be successfully applied. Finally, we evaluate our novel hybrid approach in comparison to existing state-of-the-art approaches.
机译:大量基于文档的过程极大地促进了对自动化系统的需求,这些系统能够适当地数字化有关表单的文档中的文本。例如,没有从像素到可编辑文本的适当转换,就无法访​​问扫描的管理表单中的文本。在这种背景下,许多组织利用光学字符识别(OCR)的潜力,因为它能够支持文档中文本的数字化。但是,考虑到手写文本和机器打印文本,仍然缺乏集成的OCR方法,这在将文本形式的数字化的背景下都非常重要。为了解决这个问题,我们提出了一种新的混合式OCR方法,该方法以集成的角度基于神经网络识别手写和机器打印的文本。我们使用可以成功应用该方法的公开形式来证明我们的方法的实际适用性。最后,我们将与现有的最新方法进行比较来评估我们的新型混合方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号