首页> 外文会议> >Modelling polyfont printed characters with HMMs and a shift invariant Hamming distance

【24h】

Modelling polyfont printed characters with HMMs and a shift invariant Hamming distance

机译：使用HMM和平移不变的汉明距离对Polyfont印刷字符进行建模

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Rumours of the death of the problem of machine-printed text recognition have been greatly exaggerated. Reported results can be good enough to lead one to believe that this is a "solved problem". Closer analysis reveals test data that is often limited in its range of fonts and point sizes. Worse still, results are commonly quoted for noise-free images, ignoring the problems of recognising "real" documents such as faxes. Various methods have been proposed for modelling characters with Hidden Markov Models. The authors, amongst others, have suggested representing a character by analysing the pixel pattern in columns of its image, and linking sequential column patterns together with a HMM. In this paper we propose a method of quantising the patterns by means of a Shift Invariant Hamming Distance. A full experimental evaluation (45 fonts, 5 point sizes) in typical noise results in a recognition accuracy of 99% in the top-3 choices, and 94% top-choice for the best font. The method has a significant advantage in recognising noisy word images, due to classification being achieved without a prior segmentation of the word into characters.

机译：关于机印文本识别问题死亡的谣言被大大夸大了。报告的结果可能足以使人们相信这是一个“已解决的问题”。进一步的分析揭示了测试数据，这些数据通常在其字体和磅值范围内受到限制。更糟糕的是，通常会引用无噪声图像的结果，而忽略了识别“真实”文档（例如传真）的问题。已经提出了各种方法来用隐马尔可夫模型对字符进行建模。除其他外，作者建议通过分析图像列中的像素图案，并将顺序的列图案与HMM链接在一起来表示字符。在本文中，我们提出了一种通过移位不变汉明距离对模式进行量化的方法。对典型噪声进行全面的实验评估（45种字体，5点大小）后，前三项选择的识别精度为99％，最佳字体的选择精度为94％。该方法在识别嘈杂的单词图像方面具有显着的优势，这是因为无需事先将单词分割为字符即可实现分类。

著录项

来源
《》|1995年|P.504-507|共4页
会议地点
作者
Elms; A.J.; Illingworth; J.;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类无线电电子学、电信技术;
关键词

相似文献

外文文献
中文文献
专利

1. Combination of HMMs for the representation of printed characters in noisy document images [J] . A J Elms, J Illingworth Image and Vision Computing . 1995,第5期

机译：HMM的组合，用于在嘈杂的文档图像中表示打印字符
2. Printed Arabic Character Recognition Using HMM [J] . Abbas H. Hassin, Xiang-Long Tang, Jia-Feng Liu, Journal of Computer Science & Technology . 2004,第4期

机译：使用HMM的印刷阿拉伯字符识别
3. Printed Arabic Character Recognition Using HMM [J] . Abbas H.Hassin, Xiang-Long Tang, Jia-Feng Liu, 计算机科学技术学报（英文版） . 2004,第004期

机译：使用HMM的印刷阿拉伯字符识别
4. Modelling polyfont printed characters with HMMs and a shift invariant Hamming distance [C] . Elms A.J., Illingworth J., Institute of Electric and Electronic Engineer International Conference on Document Analysis and Recognition . 1995

机译：使用HMMS建模Polyfont打印字符和换档不变汉明距离
5. Statistics of nonlinear averaging spectral estimators and a novel distance measure for HMMs with application to speech quality estimation. [D] . Liang, Hongkang. 2005

机译：非线性平均频谱估计器的统计数据和HMM的新型距离测度，并应用于语音质量估计。
6. Shifted Hamming distance: a fast and accurate SIMD-friendly filter to accelerate alignment verification in read mapping [O] . Hongyi Xin, John Greth, John Emmons, -1

机译：汉明距离偏移：快速准确的SIMD友好型过滤器可加快读取映射中的比对验证
7. A Comparative Study between the Pseudo Zernike and Krawtchouk Invariants Moments for Printed Arabic Characters Recognition [O] . Rachid Salouan, Informatic Polydisciplinary, Sultan Moulay Slimane, 2015

机译：印刷阿拉伯字符识别的伪Zernike和Krawtchouk不变矩的比较研究

Modelling polyfont printed characters with HMMs and a shift invariant Hamming distance

摘要

著录项

相似文献

相关主题

期刊订阅