首页> 外文期刊>Pattern recognition letters >Text-independent writer identification using convolutional neural network
【24h】

Text-independent writer identification using convolutional neural network

机译:卷积神经网络的文本无关作者识别

获取原文
获取原文并翻译 | 示例
           

摘要

The text-independent approach to writer identification does not require the writer to write some predetermined text. Previous research on text-independent writer identification has been based on identifying writer-specific features designed by experts. However, in the last decade, deep learning methods have been successfully applied to learn features from data automatically. We propose here an end-to-end deep-learning method for text-independent writer identification that does not require prior identification of features. A Convolutional Neural Network (CNN) is trained initially to extract local features, which represent characteristics of individual handwriting in the whole character images and their sub-regions. Randomly sampled tuples of images from the training set are used to train the CNN and aggregate the extracted local features of images from the tuples to form global features. For every training epoch, the process of randomly sampling tuples is repeated, which is equivalent to a large number of training patterns being prepared for training the CNN for text-independent writer identification. We conducted experiments on the JEITA-HP database of offline handwritten Japanese character patterns. With 200 characters, our method achieved an accuracy of 99.97% to classify 100 writers. Even when using 50 characters for 100 writers or 100 characters for 400 writers, our method achieved accuracy levels of 92.80% or 93.82%, respectively. We conducted further experiments on the Firemaker and IAM databases of offline handwritten English text. Using only one page per writer to train, our method achieved over 91.81% accuracy to classify 900 writers. Overall, we achieved a better performance than the previously published best result based on handcrafted features and clustering algorithms, which demonstrates the effectiveness of our method for handwritten English text also. (c) 2018 Elsevier B.V. All rights reserved.
机译:文本无关的作者识别方法不需要作者编写一些预定的文本。先前与文本无关的作者识别的研究是基于识别专家设计的特定于作者的功能的。然而,在过去的十年中,深度学习方法已成功应用于自动从数据中学习特征。我们在这里提出一种不需要文本先验识别的,与文本无关的作者识别的端到端深度学习方法。首先对卷积神经网络(CNN)进行训练,以提取局部特征,这些局部特征代表了整个字符图像及其子区域中单个笔迹的特征。来自训练集的图像的随机采样元组用于训练CNN并聚合从元组中提取的图像局部特征以形成全局特征。对于每个训练时期,重复对元组进行随机采样的过程,这等效于准备了大量训练模式以训练CNN以进行与文本无关的作者标识。我们在JEITA-HP数据库中进行了离线手写日语字符模式的实验。我们的方法以200个字符为准,对100位作家进行分类的准确性达到99.97%。即使使用100个作者的50个字符或400个作者的100个字符,我们的方法也分别达到92.80%或93.82%的准确度。我们在Firemaker和IAM数据库中进行了离线手写英文文本的进一步实验。每个作者仅用一页进行培训,我们的方法就可以对900名作家进行分类,准确性超过91.81%。总体而言,与以前发布的基于手工功能和聚类算法的最佳结果相比,我们获得了更好的性能,这也证明了我们的方法对于手写英文文本的有效性。 (c)2018 Elsevier B.V.保留所有权利。

著录项

  • 来源
    《Pattern recognition letters》 |2019年第4期|104-112|共9页
  • 作者单位

    Tokyo Univ Agr & Technol, Dept Comp & Informat Sci, 2-24-16 Naka Cho, Koganei, Tokyo 1848588, Japan;

    Tokyo Univ Agr & Technol, Dept Comp & Informat Sci, 2-24-16 Naka Cho, Koganei, Tokyo 1848588, Japan;

    Tokyo Univ Agr & Technol, Dept Comp & Informat Sci, 2-24-16 Naka Cho, Koganei, Tokyo 1848588, Japan;

    Jagiellonian Univ, Inst Philosophy, Krakow, Poland;

    Tokyo Univ Agr & Technol, Dept Comp & Informat Sci, 2-24-16 Naka Cho, Koganei, Tokyo 1848588, Japan;

  • 收录信息 美国《科学引文索引》(SCI);美国《工程索引》(EI);
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号