首页> 外文会议>Applications of Artificial Neural Networks >Using back error propagation networks for automatic document image classification
【24h】

Using back error propagation networks for automatic document image classification

机译:使用返回错误传播网络进行自动文档图像分类

获取原文

摘要

The Lister Hill National Center for Biomedical Communications is a Research and Development Division of the National Library of Medicine. One of the Center's current research projects involves the conversion of entire journals to bitmapped binary page images. In an effort to reduce operator errors that sometimes occur during document capture, three back error propagation networks were designed to automatically identify journal title based on features in the binary image of the journal's front cover page. For all three network designs, twenty five journal titles were randomly selected from the stored database of image files. Seven cover page images from each title were selected as the training set. For each title, three other cover page images were selected as the test set. Each bitmapped image was initially processed by counting the total number of black pixels in 32-pixel wide rows and columns of the page image. For the first network, these counts were scaled to create 122-element count vectors as the input vectors to a back error propagation network. The network had one output node for each journal classification. Although the network was successful in correctly classifying the 25 journals, the large input vector resulted in a large network and, consequently, a long training period. In an alternative approach, the first thirty-five coefficients of the Fast Fourier Transform of the count vector were used as the input vector to a second network. A third approach was to train a separate network for each journal using the original count vectors as input and with only one output node. The output of the network could be "yes" (it is this journal) or "no" (it is not this journal). This final design promises to be most efficient for a system in which journal titles are added or removed as it does not require retraining a large network for each change.
机译:Lister Hill国家生物医学通信中心是国家医学图书馆的研发部门。该中心目前的研究项目之一涉及将整个期刊转换为位映射的二进制页面图像。为了减少在文档捕获期间有时发生的操作员错误,设计了三个背部错误传播网络,旨在根据日志前封面页面的二进制图像中的功能自动识别日志标题。对于所有三种网络设计,从存储的图像文件数据库中随机选择二十五期期刊标题。每个标题的七页封面页面被选为培训集。对于每个标题,选择了三个其他封面图像作为测试集。最初通过在页面图像的32像素宽行和列中计数黑色像素的总数来处理每个位映射的图像。对于第一个网络,缩放这些计数以创建122元元计数向量作为对误差传播网络的输入向量。网络有一个输出节点,用于每个日记分类。虽然网络在正确分类25个期刊方面取得成功,但大输入载体导致大型网络,因此,长期训练期。在替代方法中,计数矢量的快速傅里叶变换的第一三十五系数被用作第二网络的输入向量。第三种方法是使用原始计数向量作为输入,只有一个输出节点训练每个日记的单独网络。网络的输出可能是“是”(这是本期刊)或“否”(这不是本期刊)。这种最终设计有望最有效地对添加或删除杂志的系统,因为它不需要为每个变化重新培训大型网络。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号