xtracting information from images of documents is easier when the image is crisp, clear and devoid of noise. Consequently, an algorithm that reliably removes noise from imperfect document images and generates better images could clean input to other image processing algorithms thereby improving their outputs and/or enabling simpler techniques. The importance of this task is evident given the rate at which scanners, copier, and smart phones are producing document images.;This dissertation makes three contributions to this problem area. The first contribution is an unsupervised method for converting a document image to a strictly white and black image (i.e., cleaning a document image). This initial contribution is the result of examining the hypothesis that acceptable binarization parameters can be found with an automatic parameter search and was patented in US Patent
展开▼