首页> 外文期刊>Neural computing & applications >Online continuous multi-stroke Persian/Arabic character recognition by novel spatio-temporal features for digitizer pen devices
【24h】

Online continuous multi-stroke Persian/Arabic character recognition by novel spatio-temporal features for digitizer pen devices

机译:在线连续多冲程波斯/阿拉伯语字符识别数字化钢笔设备的新型时空功能

获取原文
获取原文并翻译 | 示例
       

摘要

Nowadays, digitizer pens have become front end of many digital devices. The increasing use of this technology has necessitated the need for producing pen-based virtual keyboard systems. Despite attempts to create such systems in English, their absence for Persian/Arabic languages is an obvious defect. The goal of this paper is presenting an online continuous Persian/Arabic character recognition method. A character in Persian/Arabic language is made of two types of signs or strokes: main body and delayed strokes (which may be zero or more sign). In this paper, a set of novel and discriminative spatial features are defined for these strokes. These features then are used in a novel algorithm to create a genetic programming-based decision tree called GPDT. The GPDT and spatio-temporal features are utilized by non-deterministic finite automata (NDFA) to recognize group-related strokes and related characters. The reason for using spatio-temporal features is the sameness of the main body of some Persian/Arabic letters (e.g., "& x62d;& x60c; & x62e;& x60c; & x62c;& x60c; & x686;"). There are also two other issues related to recognizing Persian/Arabic letters: unknown number of delayed stroke segments and the sameness of delayed strokes placement, which are removed by using an NDFA. In fact, after identifying group of main body with the help of GPDT, each recognized stroke makes a move in NDFA to stop in a character state (final state on the end of a path in NDFA). The proposed algorithm recognizes continuous Persian/Arabic letters and digits with a 92.43% accuracy and isolated letters and digits with 97.52% accuracy.
机译:如今,数字磁带笔已成为许多数字设备的前端。越来越多的使用这种技术需要需要生产基于笔的虚拟键盘系统。尽管尝试用英语创建此类系统,但他们对波斯/阿拉伯语语言的缺席是一个明显的缺陷。本文的目标是呈现在线连续波斯/阿拉伯字符识别方法。波斯/阿拉伯语语言的一个角色由两种类型的迹象或中风组成:主体和延迟笔划(可能是零或更多标志)。在本文中,为这些笔划定义了一组新颖的和鉴别的空间特征。然后,这些功能用于新颖的算法,以创建名为GPDT的基于基于遗传编程的决策树。通过非确定性有限自动机(NDFA)利用GPDT和时空特征来识别与群体相关的笔画和相关字符。使用时空特征的原因是某些波斯/阿拉伯语字母的主体的典型(例如,“;&x62e;&x62c;&x62c;&x60c;&x686;&x686;”)。还有另外两个问题与识别波斯/阿拉伯语字母有关:未知数量的延迟行程段和延迟笔划放置的识别,通过使用NDFA除去。事实上,在借助GPDT的帮助识别主体组后,每个识别的笔划都在NDFA中移动到一个字符状态(最终状态在NDFA中的路径结束时)。该算法识别连续的波斯/阿拉伯字母和数字,精度为92.43%和隔离字母和数字精度为97.52%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号