首页> 外文期刊>International journal of reasoning-based intelligent systems >Improving post-processing optical character recognition documents with Arabic language using spelling error detection and correction
【24h】

Improving post-processing optical character recognition documents with Arabic language using spelling error detection and correction

机译:使用拼写错误检测和更正来改进阿拉伯语后处理光学字符识别文档

获取原文
获取原文并翻译 | 示例
           

摘要

The optical character recognition (OCR) is used to convert scanned documents into text. The resulted text need to be validated for correctness. The problem increased when working on Arabic text because of the complexity of Arabic language. This research aims to explore the ways of improving OCR spell checking effectiveness by proposing a post-processing Arabic OCR system based on three different approaches: Microsoft Office Word with Google online suggestion system, Ayaspell spell checker with Google online suggestion system, and using Google online suggestion system alone. We have used precision and recall in order to evaluate the effectiveness of our proposed OCR post-processing. The results show that using Microsoft Office Word with Google outperform other approaches with accuracy of (0.49).
机译:光学字符识别(OCR)用于将扫描的文档转换为文本。需要验证结果文本的正确性。由于阿拉伯语的复杂性,在处理阿拉伯语文本时,问题更加严重。这项研究的目的是通过提出一种基于三种不同方法的后处理阿拉伯语OCR系统来探索提高OCR拼写检查有效性的方法:Microsoft Office Word和Google在线建议系统,Ayaspell拼写检查器与Google在线建议系统以及Google在线使用建议系统。我们使用精度和召回率来评估建议的OCR后处理的有效性。结果表明,将Microsoft Office Word与Google结合使用时,其准确性优于(0.49)的其他方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号