首页> 外文会议>Applications of Artificial Neural Networks IV >Using back error propagation networks for automatic document image classification
【24h】

Using back error propagation networks for automatic document image classification

机译:使用反向误差传播网络进行自动文档图像分类

获取原文
获取原文并翻译 | 示例

摘要

Abstract: The Lister Hill National Center for Biomedical Communications is a Research and Development Division of the National Library of Medicine. One of the Center's current research projects involves the conversion of entire journals to bitmapped binary page images. In an effort to reduce operator errors that sometimes occur during document capture, three back error propagation networks were designed to automatically identify journal title based on features in the binary image of the journal's front cover page. For all three network designs, twenty five journal titles were randomly selected from the stored database of image files. Seven cover page images from each title were selected as the training set. For each title, three other cover page images were selected as the test set. Each bitmapped image was initially processed by counting the total number of black pixels in 32- pixel wide rows and columns of the page image. For the first network, these counts were scaled to create 122-element count vectors as the input vectors to a back error propagation network. The network had one output node for each journal classification. Although the network was successful in correctly classifying the 25 journals, the large input vector resulted in a large network and, consequently, a long training period. In an alternative approach, the first thirty-five coefficients of the Fast Fourier Transform of the count vector were used as the input vector to a second network. A third approach was to train a separate network for each journal using the original count vectors as input and with only one output node. The output of the network could be 'yes' (it is this journal) or 'no' (it is not this journal). This final design promises to be most efficient for a system in which journal titles are added or removed as it does not require retraining a large network for each change.!7
机译:摘要:李斯特山国家生物医学通讯中心是国家医学图书馆的研发部门。该中心当前的研究项目之一是将整个期刊转换为位图二进制页面图像。为了减少有时在文档捕获过程中发生的操作员错误,设计了三个反向错误传播网络,以根据期刊封面的二进制图像中的特征自动识别期刊标题。对于所有三个网络设计,从存储的图像文件数据库中随机选择了25个期刊标题。选择每个标题的七个封面图像作为训练集。对于每个标题,选择另外三个封面图像作为测试集。最初,通过对页面图像中32像素宽的行和列中的黑色像素总数进行计数来对每个位图图像进行处理。对于第一个网络,按比例缩放这些计数以创建122个元素的计数向量,作为反向误差传播网络的输入向量。网络的每个日记帐分类都有一个输出节点。尽管该网络成功地对25种期刊进行了正确分类,但是较大的输入向量导致了较大的网络,因此需要较长的培训时间。在替代方法中,将计数向量的快速傅立叶变换的前三十五个系数用作第二个网络的输入向量。第三种方法是使用原始计数向量作为输入并仅使用一个输出节点为每个日记训练一个单独的网络。网络的输出可以是“是”(这是该日记帐)或“否”(它不是该日记帐)。最终的设计对于在其中添加或删除日记标题的系统而言是最有效的,因为它不需要为每次更改重新培训大型网络。!7

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号