首页> 美国政府科技报告 >Segmentation of Touching English Letters.
【24h】

Segmentation of Touching English Letters.

机译:触摸英文字母的细分。

获取原文

摘要

This paper examines the problem of building a machine to read uncontrolled type fonts set with essentially no space between letters (within words). The consequence of this type of data, which represents the usual format of printed text, is that the data vectors produced by the optical scanner contain multiple letters and/or fragments of letters that cannot be easily separated. An algorithm based on a variant of running cross-correlation between prototype letters and successively 'windowed' fragments of the sentence is employed. the algorithm computes the Euclidean distance between prototypes and the sentence fragment in a filtered Fourier domain. It is shown that appropriate normalizaton and windowing techniques allow perfect recognition of touching letters within words. This occurs even when no apriori knowledge of letter location within the word is available, provided that suitable prototypes can be established. Multiple alphabet prototypes were then built and used to examine widely differing type fonts. Techniques to set acceptance thresholds were evaluated and the behavior of the resulting recognition system tabulated. A number of false triggers did occur in this case and these were discussed. Recommendations for further improvements in the system are suggested. (Author)

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号