首页> 外国专利> SYSTEM AND METHOD FOR AUTOMATIC SUBCHARACTER UNIT AND LEXICON GENERATION FOR HANDWRITING RECOGNITION

SYSTEM AND METHOD FOR AUTOMATIC SUBCHARACTER UNIT AND LEXICON GENERATION FOR HANDWRITING RECOGNITION

机译:手写识别自动子字符单元和生成词法的系统和方法

摘要

A system for automatic subcharacter unit and lexicon generation for handwriting recognition comprises a processing unit, a handwriting input device, and a memory wherein a segmentation unit, a subcharacter generation unit, a lexicon unit, and a modeling unit reside. The segmentation unit generates feature vectors corresponding to sample characters. The subcharacter generation unit clusters feature vectors and assigns each feature vector associated with a given cluster an identical label. The lexicon unit constructs a lexical graph for each character in a character set. The modeling unit generates a Hidden Markov Model for each set of identically-labeled feature vectors. After a first set of lexical graphs and Hidden Markov Models have been created, the subcharacter generation unit determines for each feature vector which Hidden Markov Model produces a highest likelihood value. The subcharacter generation unit relabels each feature vector according to the highest likelihood value, after which the lexicon unit and the modeling unit generate a new set of lexical graphs and a new set of Hidden Markov Models, respectively. The feature vector relabeling, lexicon generation, and Hidden Markov Model generation are performed iteratively until a convergence criterion is met. The final set of Hidden Markov Model model parameters provide a set of subcharacter units for handwriting recognition, where the subcharacter units are derived from information inherent in the sample characters themselves.
机译:用于用于手写识别的自动子字符单元和词典生成的系统包括处理单元,手写输入设备和存储器,其中驻留有分割单元,子字符生成单元,词典单元和建模单元。分割单元生成与样本字符相对应的特征向量。子字符生成单元对特征向量进行聚类,并为与给定聚类关联的每个特征向量分配相同的标签。词典单元为字符集中的每个字符构造一个词图。建模单元为每组相同标记的特征向量生成一个隐马尔可夫模型。在创建了第一组词汇图和隐马尔可夫模型之后,子字符生成单元为每个特征向量确定哪个隐马尔可夫模型产生最高似然值。子字符生成单元根据最高似然值重新标记每个特征向量,然后,词典单元和建模单元分别生成一组新的词汇图和一组新的隐马尔可夫模型。迭代执行特征向量重标记,词典生成和隐马尔可夫模型生成,直到满足收敛准则为止。最终的“隐马尔可夫模型”模型参数集提供了一组用于手写识别的子字符单元,其中这些子字符单元是从样本字符本身固有的信息中得出的。

著录项

  • 公开/公告号WO9608787A1

    专利类型

  • 公开/公告日1996-03-21

    原文格式PDF

  • 申请/专利权人 APPLE COMPUTER INC.;

    申请/专利号WO1995US11815

  • 发明设计人 CHOW YEN-LU;LEE KAI-FU;GRAJSKI KAMIL;

    申请日1995-09-14

  • 分类号G06K9/68;G06K9/22;

  • 国家 WO

  • 入库时间 2022-08-22 03:49:01

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号