【24h】

Arabic CWR Based on Correlation of Normalized Signatures of Words Images

机译:基于词图像归一化特征相关性的阿拉伯文CWR

获取原文

摘要

The traditional methods for Arabic OCR (AOCR) based on segmentation of each word into a set of characters. The Arabic language is of cursive nature, and the character's shape depends on its position in the word. There are about 100 shape of the characters have to be classified, and some of them may be overlapped. Our approach use a normalized signature of the time signal of the pulse coupled neural network PCNN, supported with some shape primitives to represent the number of the word complementary and their positions within the image of the word. A lookup dictionary of words with its signatures was constructed, and structured in groups using a decision tree. The tested signature was routed through the tree to the nearest group, and then the signature and its related word with higher correlation within the selected group will be the classified. This method overcome many difficulties arise in cursive word recognition CWR for printed script with different font type and size; also it shows higher accuracy for the classification process, 96%.
机译:阿拉伯语OCR(AOCR)的传统方法是将每个单词分割为一组字符。阿拉伯语具有草书性质,字符的形状取决于其在单词中的位置。必须对大约100个形状的字符进行分类,其中一些字符可能会重叠。我们的方法使用脉冲耦合神经网络PCNN的时间信号的归一化签名,并通过一些形状原语支持,以表示单词互补词的数量及其在单词图像中的位置。构造了一个带有签名的单词查找字典,并使用决策树将其分组构造。将经过测试的签名通过树路由到最近的组,然后将对签名及其在所选组中具有更高相关性的相关单词进行分类。该方法克服了不同字体类型和大小的草稿的草书识别CWR中遇到的许多困难。在分类过程中也显示出更高的准确性,达到96%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号