首页> 外文会议> >Segmentation-free word recognition with application to Arabic
【24h】

Segmentation-free word recognition with application to Arabic

机译:无分段单词识别及其在阿拉伯语中的应用

获取原文

摘要

This paper describes the design and implementation of a system that recognizes machine-printed Arabic words without prior segmentation. The technique is based on describing symbols in terms of shape primitives. At recognition time, the primitives are detected on a word image using mathematical morphology operations. The system then matches the detected primitives with symbol models. This leads to a spatial arrangement of matched symbol models. The system conducts a search in the space of spatial arrangements of models and outputs the arrangement with the highest posterior probability as the recognition of the word. The advantage of using this whole word approach versus a segmentation approach is that the result of recognition is optimized with regard to the whole word. Results of preliminary experiments using a lexicon of 42,000 words show a recognition rate of 99.4% for noise-free text and 73% for scanned text.
机译:本文介绍了无需预先分段即可识别机器打印的阿拉伯语单词的系统的设计和实现。该技术基于根据形状图元描述符号。在识别时,使用数学形态学运算在单词图像上检测图元。然后,系统将检测到的图元与符号模型进行匹配。这导致匹配符号模型的空间布置。该系统在模型的空间排列的空间中进行搜索,并输出具有最高后验概率的排列作为单词的识别。与分段方法相比,使用此整个单词方法的优势在于针对整个单词优化了识别结果。使用42,000个单词的词典进行的初步实验结果显示,无噪声文本的识别率为99.4%,扫描文本的识别率为73%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号