Mathematical expression recognition is one of the challenging problems in the field of handwritten recognition. Public datasets are often used to evaluate and compare different computer solutions for recognition problems in several domains of applications. However, existing public datasets for handwritten mathematical expressions and symbols are still scarce both in number and in variety. Such scarcity makes large scale assessment of the existing techniques a difficult task. This paper proposes a novel approach, based on expression matching, for generating ground-truthed exemplars of expressions (and, therefore, of symbols). Matching is formulated as a graph matching problem in which symbols of input instances of a manually labeled model expression are matched to the symbols in the model. Pairwise matching cost considers both local and global features of the expression. Experimental results show achievement of high accuracy for several types of expressions, written by different users.
展开▼