首页> 外文会议>International Conference on Progress in Informatics and Computing >An offline text-independent writer identification system with SAE feature extraction
【24h】

An offline text-independent writer identification system with SAE feature extraction

机译:具有SAE特征提取的离线文本独立作者识别系统

获取原文

摘要

This paper presents an offline text-independent writer identification system in a multi-language environment using sparse auto-encoder (SAE) based codebook (SAEC) and non-text-segmentation feature extraction methods. The proposed codebook is designed by SAE structure and clustering. The superior of the designed frame work is that features can be effectively extracted and without the pre-step of text segmentation. The novel codebook can deal with mixed-languages at the same time. Hand-written texts in Chinese and English are considered in this study. We use both HIT Chinese database and IAM offline English database. The classification rate achieves the challenging accuracy of 95.56% concerning top 1 and 99.17% concerning top 10 on the two mixed databases. Another interesting aspect of our study is the evaluation of the factors such as patch sizes, patch numbers and the amount of text that influence the identification results.
机译:本文介绍了使用基于稀疏自动编码器(SAE)的码本(SAEC)和非文本分段特征提取方法的多语言环境中的离线文本的编写器识别系统。所提出的码本采用SAE结构和聚类设计。设计的框架工作的优越性是可以有效地提取特征,并且没有文本分割的预先分割。这本新颖的码本可以同时处理混合语言。在本研究中考虑了中文和英语中的手写文本。我们使用Mist中文数据库和IAM离线英语数据库。分类率在两个混合数据库上实现了95.56 %的挑战性准确性为95.56 %。我们研究的另一个有趣方面是评估补丁尺寸,补丁号和影响识别结果的文本数量等因素。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号