首页> 外文会议> >Experimental comparisons of binarization and multi-thresholding methods on document images
【24h】

Experimental comparisons of binarization and multi-thresholding methods on document images

机译:文档图像二值化和多阈值方法的实验比较

获取原文

摘要

Thresholding methods are applied here to document images and their experimental results compared. In one set of tests, different thresholding methods are used to binarize document images, then optical character recognition (OCR) is performed on the resulting text and the recognition results are compared. In the other set of tests, multi-thresholding is performed on document images-to obtain three or more levels for images with more than binary levels-and the results are compared. Four thresholding methods are compared in the experiments: a discriminant analysis method, a maximum entropy method, a moment-preserving method, and a connectivity-preserving method. A method using a minimum-error criterion is also commented upon. The moment-preserving and connectivity-preserving methods are found to yield the best OCR results from the binarized images, and the connectivity-preserving method yields the fewest binarization and multi-thresholding failures.
机译:阈值方法在这里应用于文档图像,并比较它们的实验结果。在一组测试中,使用不同的阈值化方法对文档图像进行二值化,然后对所得文本执行光学字符识别(OCR),然后比较识别结果。在另一组测试中,对文档图像执行多阈值处理-为具有大于二进制级别的图像获得三个或更多级别-并对结果进行比较。实验中比较了四种阈值处理方法:判别分析方法,最大熵方法,力矩保留方法和连通性保留方法。还评论了一种使用最小误差准则的方法。发现保持矩和保持连通性的方法从二值化图像中产生最佳的OCR结果,而保持连通性的方法产生的二值化和多阈值故障最少。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号