首页> 外文会议>Document recognition and retrieval XIX >A Robust Omnifont Open-Vocabulary Arabic OCR System Using Pseudo-2D-HMM
【24h】

A Robust Omnifont Open-Vocabulary Arabic OCR System Using Pseudo-2D-HMM

机译:使用伪2D-HMM的健壮的Omnifont开放词汇阿拉伯语OCR系统

获取原文
获取原文并翻译 | 示例

摘要

Recognizing old documents is highly desirable since the demand for quickly searching millions of archived documents has recently increased. Using Hidden Markov Models (HMMs) has been proven to be a good solution to tackle the main problems of recognizing typewritten Arabic characters. These attempts however achieved a remarkable success for omnifont OCR under very favorable conditions, they didn't achieve the same performance in practical conditions, i.e. noisy documents. In this paper we present an omnifont, large-vocabulary Arabic OCR system using Pseudo Two Dimensional Hidden Markov Model (P2DHMM), which is a generalization of the HMM. P2DHMM offers a more efficient way to model the Arabic characters, such model offer both minimal dependency on the font size/style (omnifont), and high level of robustness against noise. The evaluation results of this system are very promising compared to a baseline HMM system and best OCRs available in the market (Sakhr and NovoDynamics). The recognition accuracy of the P2DHMM classifier is measured against the classic HMM classifier, the average word accuracy rates for P2DHMM and HMM classifiers are 79% and 66% respectively. The overall system accuracy is measured against Sakhr and NovoDynamics OCR systems, the average word accuracy rates for P2DHMM, NovoDynamics, and Sakhr are 74%, 71%, and 61% respectively.
机译:识别旧文档是非常需要的,因为最近对快速搜索数百万个存档文档的需求增加了。使用隐马尔可夫模型(HMM)已被证明是解决识别打字阿拉伯字符的主要问题的好方法。然而,这些尝试在非常有利的条件下为全能OCR取得了令人瞩目的成功,但在实际条件下(即嘈杂的文件)却没有达到相同的性能。在本文中,我们使用伪二维隐马尔可夫模型(P2DHMM)提出了一种全字体,大词汇量的阿拉伯语OCR系统,该系统是HMM的推广。 P2DHMM提供了一种更有效的阿拉伯字符建模方法,这种模型既对字体大小/样式(全字体)具有最小的依赖性,又对噪声具有很高的鲁棒性。与基线HMM系统和市场上可用的最佳OCR(Sakhr和NovoDynamics)相比,该系统的评估结果非常有前途。 P2DHMM分类器的识别准确度是根据经典HMM分类器测得的,P2DHMM和HMM分类器的平均单词准确率分别为79%和66%。总体系统准确度是根据Sakhr和NovoDynamics OCR系统测得的,P2DHMM,NovoDynamics和Sakhr的平均字准确率分别为74%,71%和61%。

著录项

  • 来源
    《Document recognition and retrieval XIX》|2012年|p.829707.1-829707.8|共8页
  • 会议地点 Burlingame CA(US)
  • 作者单位

    Faculty of Engineering, Cairo University, Giza, EGYPT,The Engineering Company for the Development of Computer Systems RDI, EGYPT;

    Faculty of Engineering, Cairo University, Giza, EGYPT,The Engineering Company for the Development of Computer Systems RDI, EGYPT;

    Faculty of Computers and Information, Cairo University, Giza, EGYPT,The Engineering Company for the Development of Computer Systems RDI, EGYPT;

    Faculty of Computers and Information, Cairo University, Giza, EGYPT,The Engineering Company for the Development of Computer Systems RDI, EGYPT;

    Faculty of Engineering, Cairo University, Giza, EGYPT;

  • 会议组织
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类 信息处理(信息加工);
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号