Convolutional Neural Networks for Font Classification

机译：用于字体分类的卷积神经网络

获取原文

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

Classifying pages or text lines into font categories aids transcription because single font Optical Character Recognition (OCR) is generally more accurate than omni-font OCR. We present a simple framework based on Convolutional Neural Networks (CNNs), where a CNN is trained to classify small patches of text into predefined font classes. To classify page or line images, we average the CNN predictions over densely extracted patches. We show that this method achieves state-of-the-art performance on a challenging dataset of 40 Arabic computer fonts with 98.8% line level accuracy. This same method also achieves the highest reported accuracy of 86.6% in predicting paleographic scribal script classes at the page level on medieval Latin manuscripts. Finally, we analyze what features are learned by the CNN on Latin manuscripts and find evidence that the CNN is learning both the defining morphological differences between scribal script classes as well as overfitting to class-correlated nuisance factors. We propose a novel form of data augmentation that improves robustness to text darkness, further increasing classification performance.

机译：将页面或文本行分类为字体类别辅助转录，因为单个字体光学字符识别（OCR）通常比OMNI-Font OCR更精确。我们介绍了一个基于卷积神经网络（CNNS）的简单框架，其中CNN培训，以将小文本的小块分类为预定义的字体类。为了对页面或行映像进行分类，我们将平均在密集提取的斑块上的CNN预测。我们表明，该方法在40个阿拉伯语计算机字体的具有挑战性的数据集中实现了最先进的性能，线路电平精度为98.8％。这种方法还实现了在中世纪拉丁文稿中的页面级别预测古血统剧本课程的最高报告准确性86.6％。最后，我们分析了拉丁文稿中的CNN学习了哪些功能，并找到了CNN在学习血交脚本类别之间的定义形态学差异以及对类相关的滋扰因子的过度来看。我们提出了一种新颖的数据增强形式，可以提高对文本黑暗的鲁棒性，进一步提高分类性能。

著录项

来源
《IAPR International Conference on Document Analysis and Recognition》|2017年|733-1472p|共6页
会议地点
作者
Chris Tensmeyer; Daniel Saunders; Tony Martinez;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP391.41-53;
关键词
Document Image Classification; Convolutional Neural Networks; Deep Learning; Preprocessing; Data Augmentation; Network Architecture;

机译：文档图像分类;卷积神经网络;深度学习;预处理;数据增强;网络架构;

相似文献

外文文献
中文文献
专利

1. Application of Deep Convolutional Neural Networks in Attention-Deficit/Hyperactivity Disorder Classification: Data Augmentation and Convolutional Neural Network Transfer Learning [J] . Zhu Li, Chang Weike Journal of Medical Imaging and Health Informatics . 2019,第8期

机译：深度卷积神经网络在注意力缺陷/多动障碍分类中的应用：数据增强与卷积神经网络转移学习
2. Fractal Neural Network: A new ensemble of fractal geometry and convolutional neural networks for the classification of histology images [J] . Roberto Guilherme Freire, Lumini Alessandra, Neves Leandro Alves, Expert systems with applications . 2021,第Mara期

机译：分形神经网络：组织学图像分类的分形几何和卷积神经网络的新集合
3. Improved Breast Cancer Classification Through Combining Graph Convolutional Network and Convolutional Neural Network [J] . Yu-Dong Zhang, Suresh Chandra Satapathy, David S. Guttery, Information Processing & Management . 2021,第2期

机译：通过结合图卷积网络和卷积神经网络改善乳腺癌分类
4. Convolutional Neural Networks for Font Classification [C] . Chris Tensmeyer, Daniel Saunders, Tony Martinez IAPR International Conference on Document Analysis and Recognition . 2017

机译：卷积神经网络的字体分类
5. Combining Convolutional Neural Networks and Graph Neural Networks for Image Classification [D] . Trivedy, Vivek. 2021

机译：结合卷积神经网络和图形神经网络的图像分类
6. 3D Convolutional Neural Networks Initialized from Pretrained 2D Convolutional Neural Networks for Classification of Industrial Parts [O] . Ibon Merino, Jon Azpiazu, Anthony Remazeilles, 2021

机译：3D卷积神经网络从佩带的2D卷积神经网络初始化用于工业部件的分类
7. Convolutional Neural Networks for Font Classification [O] . Tensmeyer, Chris, Saunders, Daniel, Martinez, Tony 2017

机译：用于字体分类的卷积神经网络

Convolutional Neural Networks for Font Classification

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅