Machine recognition of mathematical expressions on printed documents is not trivial even when all the individual characters and symbols in an expression can be recognized correctly. In this paper, an automatic classification method of spatial relationships between the adjacent symbols in a pair is presented. This classification is important to realize an accurate structure an tlysis module of math OCR. Experimental results on very large databases showed that this ciassilication worked well with an accuracy of 99.525 by using distribution maps which are defined by two geometrie features, relative size and relative position, with eareful treatment on document-dependents haracteristics.
展开▼