首页> 外文学位 >InkLink: A writer-dependent on-line unconstrained handwriting recognition system.
【24h】

InkLink: A writer-dependent on-line unconstrained handwriting recognition system.

机译:InkLink:依赖于书写者的在线无限制手写识别系统。

获取原文
获取原文并翻译 | 示例

摘要

InkLink is a new recognition method for on-line handwriting. Detecting poly-gram matches between words at lexically predicted locations avoids the segmentation and vocabulary limitations of character-level and word-level recognition systems, respectively. At the signal level, ligatures facilitate matching longer segments because individual characters are often indistinct. The InkLink prototype word recognition system has access to a lexicon of plausible labels, and to a set of handwritten reference words represented by extremal-point and chain code features. Two additional feature representations are obtained by reordering and pruning the time-sorted extremal-point features. For each set of features, the average number of features per letter segment is obtained from the reference set by least squares estimation.; The system consists of three stages: lexical processing, signal matching and classification. The lexical stage pre-computes the match length and location of every word in the lexicon by matching its label to the label of every reference word. The unknown word is hypothesized to be each word in the lexicon. The length and location of feature-level matches are predicted by applying the character feature-length estimates to the lexical matches between the candidate word and reference set. At these predicated locations, the signal matching stage detects the longest observed feature-level match between the unknown and each reference word. The distribution of observed feature-level match lengths, conditioned on predicted match lengths, is estimated using localized left and right word alignments. The Bayesian classification stage assigns to the unknown word the label of the lexical candidate that maximizes the probability of the observed feature-level match lengths at the predicted locations. The rank orders based on the four feature sets are combined by Borda Count.; On a typical writer, with a lexicon of 1000 words and a reference set of 1000 words, the final accuracy is 84.1%. With a lexicon of 100 words and writer-specific reference sets of 500 words, the accuracy on a dozen new writers ranges from 48.3% to 97.3%. The higher accuracies are obtained on smooth, unslanted writing. Other possible applications include handwritten and printed text in other alphabetic scripts, and speech recognition based on phonetically labeled reference sets.
机译:InkLink是一种用于在线手写的新识别方法。在词汇预测位置检测单词之间的多义词匹配,分别避免了字符级和单词级识别系统的细分和词汇限制。在信号级别,连字有助于匹配更长的段,因为单个字符通常不清楚。 InkLink原型单词识别系统可以访问似是而非的标签词典,并可以访问由极点和链码功能表示的一组手写参考单词。通过对时间排序的极点特征进行重新排序和修剪,可以获得两个附加特征表示。对于每组特征,通过最小二乘估计从参考集中获得每个字母段的平均特征数。该系统包括三个阶段:词法处理,信号匹配和分类。词汇阶段通过将单词的标签与每个参考单词的标签进行匹配来预先计算单词中每个单词的匹配长度和位置。假设未知词是词典中的每个词。通过将字符特征长度估计应用于候选单词和参考集之间的词汇匹配,可以预测特征级别匹配的长度和位置。在这些预定位置,信号匹配阶段检测到未知和每个参考词之间最长的观察到的特征级匹配。使用预测的匹配长度为条件的观察到的特征级别匹配长度的分布,使用局部左对齐和右对齐来估计。贝叶斯分类阶段为未知单词分配词汇候选的标签,该标签使在预测位置处观察到的特征级匹配长度的概率最大化。基于四个功能集的排名由Borda Count合并。在具有1000个单词的词典和1000个单词的参考集的典型作家中,最终准确性为84.1%。使用100个单词的词典和500个单词的特定于作者的参考集,一打新作者的准确率在48.3%至97.3%之间。较高的准确度是通过流畅,倾斜的书写获得的。其他可能的应用程序包括其他字母脚本中的手写和打印文本,以及基于语音标记的参考集的语音识别。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号