首页> 外文OA文献 >Application of Recognition Input Squinting and Error-Correcting Output Coding to Convolutional Neural Networks
【2h】

Application of Recognition Input Squinting and Error-Correcting Output Coding to Convolutional Neural Networks

机译:识别输入斜视和纠错输出编码在卷积神经网络中的应用

摘要

The Convolutional Neural Network (CNN) is a type of artificial neural network that is successful in addressing many computer vision classification problems. This thesis considers problems related to optical character recognition by CNN when few training samples are available. Two techniques are proposed that can be used to improve the application of CNNs to such problems and these benefits are demonstrated experimentally on subsets of two labelled databases: MNIST (handwritten digits) and CENPARMI-MPC (machineprintedudcharacters).ududThe first technique is novel and is called “Recognition Input Squinting”. It involves taking the input image to be recognized and applying a set of geometric transformations on it to produce a set of squinted images. The trained CNN classifier then recognizes each of these generated input images and computes an overall recognition confidence score. It is shown that this technique yields superior recognition precision as compared to the caseudwhere a single input image is recognized without squinting.ududThe second technique is an application of the Error-Correcting Output Coding technique to the CNN. Each class to be recognized is assigned a codeword from an appropriately chosen error-correcting code’s codebook and the CNN is trained using these codeword labels. At recognition time, the output class is selected according to a minimum code distance criterion. It is shown that this technique provides better recognition precision than when the classic place output coding is used.
机译:卷积神经网络(CNN)是一种成功解决许多计算机视觉分类问题的人工神经网络。本文考虑了在训练样本很少的情况下与CNN进行光学字符识别有关的问题。提出了两种可用于改进CNN在此类问题上的应用的技术,并且在两个标记的数据库的子集上通过实验证明了这些好处:MNIST(手写数字)和CENPARMI-MPC(机印 udcharacters)。 ud ud该技术是新颖的,被称为“识别输入斜视”。它涉及获取要识别的输入图像,并在其上应用一组几何变换以生成一组斜眼图像。然后,训练有素的CNN分类器识别这些生成的输入图像中的每一个,并计算总体识别置信度得分。结果表明,与在不斜视的情况下识别单个输入图像的情况相比,该技术具有更高的识别精度。 ud ud第二种技术是将纠错输出编码技术应用于CNN。从适当选择的纠错代码的代码簿中为每个要识别的类别分配一个代码字,并使用这些代码字标签对CNN进行培训。在识别时,根据最小代码距离标准选择输出类别。结果表明,与使用经典位置输出编码时相比,该技术提供了更好的识别精度。

著录项

  • 作者

    Stathopoulos George;

  • 作者单位
  • 年度 2011
  • 总页数
  • 原文格式 PDF
  • 正文语种 en
  • 中图分类

相似文献

  • 外文文献
  • 中文文献
  • 专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号