首页> 外文学位 >Freeform cursive handwriting recognition using a clustered neural network.
【24h】

Freeform cursive handwriting recognition using a clustered neural network.

机译:使用聚类神经网络的自由形式草书手写识别。

获取原文
获取原文并翻译 | 示例

摘要

Optical character recognition (OCR) software has advanced greatly in recent years. Machine-printed text can be scanned and converted to searchable text with word accuracy rates around 98%. Reasonably neat hand-printed text can be recognized with about 85% word accuracy. However, cursive handwriting still remains a challenge, with state-of-the-art performance still around 75%. Algorithms based on hidden Markov models have been only moderately successful, while recurrent neural networks have delivered the best results to date.;This thesis explored the feasibility of using a special type of feedforward neural network to convert freeform cursive handwriting to searchable text. The hidden nodes in this network were grouped into clusters, with each cluster being trained to recognize a unique character bigram. The network was trained on writing samples that were pre-segmented and annotated. Post-processing was facilitated in part by using the network to identify overlapping bigrams that were then linked together to form words and sentences. With dictionary assisted postprocessing, the network achieved word accuracy of 66.5% on a small, proprietary corpus.;The contributions in this thesis are threefold: 1) the novel clustered architecture of the feed-forward neural network, 2) the development of an expanded set of observers combining image masks, modifiers, and feature characterizations, and 3) the use of overlapping bigrams as the textual working unit to assist in context analysis and reconstruction.
机译:近年来,光学字符识别(OCR)软件取得了很大的进步。可以扫描机器打印的文本,并将其转换为可搜索的文本,字的准确率约为98%。可以以大约85%的单词准确度识别出合理整齐的手写文本。但是,草书手写仍然是一个挑战,其最新性能仍然约为75%。迄今为止,基于隐马尔可夫模型的算法仅取得了一定程度的成功,而递归神经网络迄今仍取得了最佳效果。本论文探讨了使用一种特殊类型的前馈神经网络将自由形式草书手写体转换为可搜索文本的可行性。该网络中的隐藏节点被分组为群集,每个群集都经过训练以识别唯一字符bigram。对网络进行了有关编写预先分段和注释的样本的培训。通过使用网络识别重叠的二元组,然后通过链接在一起形成单词和句子,部分地促进了后处理。借助字典辅助后处理,该网络在一个小的专有语料库上的单词准确度达到66.5%。该论文的贡献有三点:1)前馈神经网络的新型集群架构,2)扩展的扩展一组将图像蒙版,修饰符和特征表征相结合的观察者,以及3)使用重叠的二元组作为文本工作单元来辅助上下文分析和重建。

著录项

  • 作者

    Bristow, Kelly H.;

  • 作者单位

    University of North Texas.;

  • 授予单位 University of North Texas.;
  • 学科 Computer science.
  • 学位 M.S.
  • 年度 2015
  • 页码 52 p.
  • 总页数 52
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

  • 入库时间 2022-08-17 11:52:20

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号