【24h】

How Does a CNN Manage Different Printing Types?

机译:CNN如何管理不同的打印类型?

获取原文

摘要

In past OCR research, different OCR engines are used for different printing types, i.e., machine-printed characters, handwritten characters, and decorated fonts. A recent research, however, reveals that convolutional neural networks (CNN) can realize a universal OCR, which can deal with any printing types without pre-classification into individual types. In this paper, we analyze how CNN for universal OCR manage the different printing types. More specifically, we try to find where a handwritten character of a class and a machine-printed character of the same class are "fused" in CNN. For analysis, we use two different approaches. The first approach is statistical analysis for detecting the CNN units which are sensitive (or insensitive) to type difference. The second approach is network-based visualization of pattern distribution in each layer. Both analyses suggest the same trend that types are not fully fused in convolutional layers but the distributions of the same class from different types become closer in upper layers.
机译:在过去的OCR研究中,不同的OCR发动机用于不同的打印类型,即机器印刷的字符,手写字符和装饰字体。然而,最近的研究表明,卷积神经网络(CNN)可以实现通用的OCR,这可以处理任何打印类型,而无需预先分类为单独的类型。在本文中,我们分析了CNN如何为Universal OCR管理不同的打印类型。更具体地说,我们尝试找到类的手写字符和同一类的机器打印字符在CNN中“融合”。对于分析,我们使用两种不同的方法。第一种方法是检测敏感(或不敏感)的CNN单元的统计分析。第二种方法是每层模式分布的基于网络的可视化。两种分析都表明了与卷积层中没有完全融合的相同趋势,但是来自不同类型的同一类的分布在上层更近。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号