【24h】

Proposition to distinguish machine-printed from handwritten Arabic and Latin words

机译:区分机器打印和手写阿拉伯语和拉丁语单词的主张

获取原文
获取原文并翻译 | 示例

摘要

In this work, we gathered some contributions to identify script and its nature. We successfully employed many features to distinguish between handwritten and machine-printed Arabic and Latin scripts at word level. Some of them are previously used in the literature, and the others are here proposed. The new proposed structural features are intrinsic to Arabic and Latin scripts. The performance of all extracted features is studied towards this paper. We also compared the performance of three classifiers: Bayes (AODEsr), k-Nearest Neighbor (k-NN) and Decision Tree (J48), used to identify the script at word level. These classifiers have been chosen enough different to test the feature contributions. We carried experiments using standard databases. Obtained results demonstrate used feature capability to capture differences between scripts. Using a set of 58 selected features and a Bayes-based classifier, we achieved an average identification rate equals to 98.72%, which considered a very satisfactory rate compared to some related works.
机译:在这项工作中,我们收集了一些内容来识别脚本及其性质。我们成功地利用了许多功能来区分单词级别的手写和机器打印的阿拉伯语和拉丁语脚本。其中一些以前在文献中使用过,其他在这里提出。拟议的新结构特征是阿拉伯文和拉丁文文字所固有的。本文针对所有提取特征的性能进行了研究。我们还比较了三个分类器的性能:贝叶斯(AODEsr),k最近邻(k-NN)和决策树(J48),用于在单词级别识别脚本。这些分类器已选择足够不同以测试特征贡献。我们使用标准数据库进行了实验。获得的结果证明了已使用的功能捕获了脚本之间的差异。使用一组58个选定特征和一个基于贝叶斯的分类器,我们实现了平均识别率等于98.72%,与某些相关作品相比,这是一个非常令人满意的比率。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号