首页> 外文会议>Iranian Conference on Signal Processing and Intelligent Systems >Document Image Classification using SqueezeNet Convolutional Neural Network
【24h】

Document Image Classification using SqueezeNet Convolutional Neural Network

机译:使用SqueezeNet卷积神经网络进行文档图像分类

获取原文

摘要

SqueezeNet networks perform well on image classification tasks, achieving accuracies comparable to state of the art convolutional neural networks. In this research we evaluate the performance of SqueezeNet networks in document image classification, showing that an ImageNet pretrained SqueezeNet achieves an accuracy of approximately 75 percent over 10 classes on the Tobacco-3482 dataset, which is comparable to other state of the art convolutional neural networks in terms of accuracy, while containing 50 times less weights compared to them. We then visualize saliency maps of the gradient of our networks output to input, which shows that the network is able to learn meaningful features that are useful for document classification. Features such as the existence of handwritten text, document titles, text alignment and tabular structures, which are proof that the network does not overfit to redundant representations of the dataset itself.
机译:SqueezeNet网络在图像分类任务上表现出色,可实现与最先进的卷积神经网络相当的准确性。在这项研究中,我们评估了SqueezeNet网络在文档图像分类中的性能,表明ImageNet预先训练的SqueezeNet在Tobacco-3482数据集的10个类别上的准确度达到了大约75%,这与其他先进的卷积神经网络相当就准确性而言,重量却比它们少50倍。然后,我们将网络输出到输入的梯度的显着图可视化,这表明网络能够学习对文档分类有用的有意义的功能。诸如手写文本,文档标题,文本对齐和表格结构之类的特征,证明了网络不会过度适应数据集本身的冗余表示。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号