首页> 外文会议>IAPR International Conference on Document Analysis and Recognition >Convolutional Neural Networks for Font Classification
【24h】

Convolutional Neural Networks for Font Classification

机译:用于字体分类的卷积神经网络

获取原文

摘要

Classifying pages or text lines into font categories aids transcription because single font Optical Character Recognition (OCR) is generally more accurate than omni-font OCR. We present a simple framework based on Convolutional Neural Networks (CNNs), where a CNN is trained to classify small patches of text into predefined font classes. To classify page or line images, we average the CNN predictions over densely extracted patches. We show that this method achieves state-of-the-art performance on a challenging dataset of 40 Arabic computer fonts with 98.8% line level accuracy. This same method also achieves the highest reported accuracy of 86.6% in predicting paleographic scribal script classes at the page level on medieval Latin manuscripts. Finally, we analyze what features are learned by the CNN on Latin manuscripts and find evidence that the CNN is learning both the defining morphological differences between scribal script classes as well as overfitting to class-correlated nuisance factors. We propose a novel form of data augmentation that improves robustness to text darkness, further increasing classification performance.
机译:将页面或文本行分类为字体类别辅助转录,因为单个字体光学字符识别(OCR)通常比OMNI-Font OCR更精确。我们介绍了一个基于卷积神经网络(CNNS)的简单框架,其中CNN培训,以将小文本的小块分类为预定义的字体类。为了对页面或行映像进行分类,我们将平均在密集提取的斑块上的CNN预测。我们表明,该方法在40个阿拉伯语计算机字体的具有挑战性的数据集中实现了最先进的性能,线路电平精度为98.8%。这种方法还实现了在中世纪拉丁文稿中的页面级别预测古血统剧本课程的最高报告准确性86.6%。最后,我们分析了拉丁文稿中的CNN学习了哪些功能,并找到了CNN在学习血交脚本类别之间的定义形态学差异以及对类相关的滋扰因子的过度来看。我们提出了一种新颖的数据增强形式,可以提高对文本黑暗的鲁棒性,进一步提高分类性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号