首页>
外国专利>
SYSTEM AND METHOD FOR AUTOMATIC SUBCHARACTER UNIT AND LEXICON GENERATION FOR HANDWRITING RECOGNITION
SYSTEM AND METHOD FOR AUTOMATIC SUBCHARACTER UNIT AND LEXICON GENERATION FOR HANDWRITING RECOGNITION
展开▼
机译:手写识别自动子字符单元和生成词法的系统和方法
展开▼
页面导航
摘要
著录项
相似文献
摘要
A system for automatic subcharacter unit and lexicon generation for handwriting recognition comprises a processing unit, a handwriting input device, and a memory wherein a segmentation unit, a subcharacter generation unit, a lexicon unit, and a modeling unit reside. The segmentation unit generates feature vectors corresponding to sample characters. The subcharacter generation unit clusters feature vectors and assigns each feature vector associated with a given cluster an identical label. The lexicon unit constructs a lexical graph for each character in a character set. The modeling unit generates a Hidden Markov Model for each set of identically-labeled feature vectors. After a first set of lexical graphs and Hidden Markov Models have been created, the subcharacter generation unit determines for each feature vector which Hidden Markov Model produces a highest likelihood value. The subcharacter generation unit relabels each feature vector according to the highest likelihood value, after which the lexicon unit and the modeling unit generate a new set of lexical graphs and a new set of Hidden Markov Models, respectively. The feature vector relabeling, lexicon generation, and Hidden Markov Model generation are performed iteratively until a convergence criterion is met. The final set of Hidden Markov Model model parameters provide a set of subcharacter units for handwriting recognition, where the subcharacter units are derived from information inherent in the sample characters themselves.
展开▼