A Self-Training Learning Document Binarization Framework

机译：自我训练学习文档二值化框架

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Document Image Binarization techniques have been studied for many years, and many practical binarization techniques have been developed and applied successfully on commercial document analysis systems. However, the current state-of-the-art methods, fail to produce good binarization results for many badly degraded document images. In this paper, we propose a self-training learning framework for document image binarization. Based on reported binarization methods, the proposed framework first divides document image pixels into three categories, namely, foreground pixels, background pixels and uncertain pixels. A classifier is then trained by learning from the document image pixels in the foreground and background categories. Finally, the uncertain pixels are classified using the learned pixel classifier. Extensive experiments have been conducted over the dataset that is used in the recent Document Image Binarization Contest(DIBCO) 2009. Experimental results show that our proposed framework significantly improves the performance of reported document image binarization methods.

机译：已经对文档图像二值化技术进行了多年研究，并且已经开发了许多实用的二值化技术并将其成功应用于商业文档分析系统。但是，当前的最新技术无法对许多质量严重下降的文档图像产生良好的二值化结果。在本文中，我们提出了一种用于文档图像二值化的自训练学习框架。基于报告的二值化方法，该框架首先将文档图像像素分为三类，即前景像素，背景像素和不确定像素。然后通过从前景和背景类别中的文档图像像素中学习来训练分类器。最后，使用学习的像素分类器对不确定像素进行分类。已经对最近的文档图像二值化竞赛（DIBCO）2009中使用的数据集进行了广泛的实验。实验结果表明，我们提出的框架显着提高了已报告文档图像二值化方法的性能。

著录项

来源
《2010 20th International Conference on Pattern Recognition》|2010年|P.3187-3190|共4页
会议地点
作者
Bolan Su; Shijian Lu; Tan Chew Lim;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类模式识别与装置;
关键词
document image binarization; image pixel classification; self-training learning framework;

机译：文档图像二值化;图像像素分类;自训练学习框架;

相似文献

外文文献
中文文献
专利

1. A learning framework for the optimization and automation of document binarization methods [J] . Mohamed Cheriet, Reza Farrahi Moghaddam, Rachid Hedjam Computer vision and image understanding . 2013,第3期

机译：用于文档二值化方法的优化和自动化的学习框架
2. A multi-scale framework for adaptive binarization of degraded document images [J] . Moghaddam RF, Cheriet M Pattern Recognition: The Journal of the Pattern Recognition Society . 2010,第6期

机译：降级文档图像自适应二值化的多尺度框架
3. DeepOtsu: Document enhancement and binarization using iterative deep learning [J] . He Sheng, Schomaker Lambert Pattern Recognition: The Journal of the Pattern Recognition Society . 2019,第期

机译：Deepotsu：使用迭代深度学习的文献增强和二值化
4. A Self-Training Learning Document Binarization Framework [C] . Bolan Su, Shijian Lu, Tan Chew Lim International Conference on Pattern Recognition . 2010

机译：自我培训学习文件二值化框架
5. Effective and efficient binarization of degraded document images. [D] . Parker, Jon Ivan. 2016

机译：对退化的文档图像进行有效和高效的二值化。
6. Robust Combined Binarization Method of Non-Uniformly Illuminated Document Images for Alphanumerical Character Recognition [O] . Hubert Michalak, Krzysztof Okarma 2020

机译：非均匀照明文档图像的鲁棒组合二值化方法用于字母数字字符识别
7. A Self-training Learning Document Binarization Framework [O] . Bolan Su, Shijian Lu, Chew Lim Tan 2011

机译：自我训练学习文档二值化框架

A Self-Training Learning Document Binarization Framework

摘要

著录项

相似文献

相关主题

期刊订阅