【24h】

A New Strategy of OCR Combination

机译:OCR组合的新策略

获取原文
获取原文并翻译 | 示例

摘要

In this paper, we present a new method for combining OCR systems in order to optimize document recognition. The proposed method combines the results of individual OCR systems, which are classified into several categories according to their characteristics. A set of OCR systems are evaluated using several performance measures. The contribution of this paper is twofold. First of all, we assessed OCR system according to two new criteria: their performance in table recognition and their capacity to recognize graphs. Furthermore we defined a new strategy of OCR combination. Our hybrid strategy is based on three techniques: the image analysis, the parallel combination approach, and the majority vote method. None of the related combination methods was interested in the analysis of the document image. With this new method, we aim to increase the document recognition rate regardless to their structure (simple or complex) and it composition (text, graphs, tables, images, etc).
机译:在本文中,我们提出了一种结合OCR系统以优化文档识别的新方法。所提出的方法结合了各个OCR系统的结果,这些结果根据其特性分为几类。一组OCR系统使用几种性能指标进行评估。本文的贡献是双重的。首先,我们根据两个新标准评估了OCR系统:它们在表识别中的性能以及它们在识别图形方面的能力。此外,我们定义了一种新的OCR组合策略。我们的混合策略基于三种技术:图像分析,并行组合方法和多数表决方法。相关的组合方法都没有对文档图像的分析感兴趣。通过这种新方法,我们旨在提高文档识别率,无论其结构(简单或复杂)及其组成(文本,图形,表格,图像等)如何。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号