首页> 外文期刊>Pattern Recognition: The Journal of the Pattern Recognition Society >A model-based approach to offline text-independent Arabic writer identification and verification
【24h】

A model-based approach to offline text-independent Arabic writer identification and verification

机译:基于模型的离线文本无关阿拉伯作家识别和验证方法

获取原文
获取原文并翻译 | 示例
           

摘要

Rapid movement generation models are described in the literature as an efficient tool to apprehend the handwriting behavior. Fields of application are diverse, including handwriting description, regeneration, and more recently OCR. In this paper, we propose a grapheme-based approach to offline Arabic writer identification and verification. Rather than extracting naturel graphemes from a training corpus using segmentation and clustering, it synthesizes its own graphemes based on the beta-elliptic model. Originality lies in the independence of the grapheme codebook from any training process, and the use of a model instead. One full and four partial codebooks are generated and tested. Using feature selection, raw codebooks are reduced in size with respect to FDR, FDR and cross-correlation, and random subsampling criteria. A total of 60 feature vectors are extracted using template matching, and evaluated with 411 individual writers from the IFN/ENIT database. The results presented in this study demonstrated the wide representativity and the good generalization capability of synthetic codebooks. We obtained a topl rate=90.02% and a top5 rate= 96.35% for writer identification, and an EER= 2.1% for writer verification. Our approach showed better properties than most of the surveyed techniques in terms of supported corpus size and identification rates. To the best of our knowledge, this study is among the first to exploit the concept of model-based synthetic codebooks in writer identification and verification. (C) 2014 Elsevier Ltd. All rights reserved.
机译:快速运动生成模型在文献中被描述为了解手写行为的有效工具。应用领域是多种多样的,包括手写说明,重新生成以及最近的OCR。在本文中,我们提出了一种基于字素的离线阿拉伯作家识别和验证方法。它不是使用分割和聚类从训练语料库中提取自然字素,而是根据β-椭圆模型合成自己的字素。独创性在于字素码本与任何训练过程的独立性,以及模型的使用。生成并测试了一本完整的密码本和四本部分的密码本。使用特征选择,相对于FDR,FDR和互相关以及随机子采样标准,可以减少原始码本的大小。使用模板匹配提取总共60个特征向量,并使用411个独立作者从IFN / ENIT数据库中进行评估。这项研究提出的结果证明了合成码本的广泛代表性和良好的泛化能力。对于作者识别,我们获得了最高纪录率为90.02%,top5比率为96.35%,对于作者进行了验证,EER = 2.1%。在支持的语料库大小和识别率方面,我们的方法显示出比大多数调查技术更好的属性。据我们所知,这项研究是最早在作者识别和验证中利用基于模型的合成密码本概念的研究之一。 (C)2014 Elsevier Ltd.保留所有权利。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号