首页> 外文会议>Document recognition and retrieval XX >Evaluation of lexicon size variations on a verification and rejection system based on SVM,for accurate and robust recognition of handwritten words
【24h】

Evaluation of lexicon size variations on a verification and rejection system based on SVM,for accurate and robust recognition of handwritten words

机译:在基于SVM的验证和拒绝系统上评估词典大小变化,以准确,可靠地识别手写单词

获取原文
获取原文并翻译 | 示例

摘要

The transcription of handwritten words remains a still challenging and difficult task. When processing full pages, approaches are limited by the trade-off between automatic recognition errors and the tedious aspect of human user verification. In this article, we present our investigations to improve the capabilities of an automatic recognizer, so as to be able to reject unknown words (not to take wrong decisions) while correctly rejecting (i.e. to recognize as much as possible from the lexicon of known words). This is the active research topic of developing a verification system that optimize the trade-off between performance and reliability. To minimize the recognition errors, a verification system is usually used to accept or reject the hypotheses produced by an existing recognition system. Thus, we re-use our novel verification architecture1 here: the recognition hypotheses are re-scored by a set of support vector machines, and validated by a verification mechanism based on multiple rejection thresholds. In order to tune these (class-dependent) rejection thresholds, an algorithm based on dynamic programming has been proposed which focus on maximizing the recognition rate for a given error rate. Experiments have been carried out on the RIMES database in three steps. The first two showed that this approach results in a performance superior or equal to other state-of-the-art rejection methods. We focus here on the third one showing that this verification system also greatly improves results of keywords extraction in a set of handwritten words, with a strong robustness to lexicon size variations (21 lexicons have been tested from 167 entries up to 5,600 entries) which is particularly relevant to our application context cooperating with humans, and only made possible thanks to the rejection ability of this proposed system. The proposed verification system, compared to a HMM with simple rejection, improves on average the recognition rate by 57% (resp. 33% and 21%) for a given error rate of 1% (resp. 5% and 10%).
机译:手写单词的转录仍然是一项艰巨而艰巨的任务。在处理整页时,方法受到自动识别错误与人工验证的繁琐方面之间的权衡的限制。在本文中,我们提出了一些研究,以提高自动识别器的功能,以便能够拒绝未知单词(不会做出错误的决定),同时能够正确拒绝(即从已知单词的词典中尽可能多地识别出来) )。这是开发验证系统的积极研究主题,该验证系统可优化性能和可靠性之间的权衡。为了使识别错误最小化,通常使用验证系统来接受或拒绝现有识别系统产生的假设。因此,我们在这里重用了新颖的验证架构:识别假说由一组支持向量机重新评分,并由基于多个拒绝阈值的验证机制进行验证。为了调整这些(与类有关的)拒绝阈值,已经提出了一种基于动态编程的算法,该算法专注于在给定的错误率下最大化识别率。分三个步骤在RIMES数据库上进行了实验。前两个结果表明,此方法所产生的性能优于或等于其他最新的拒绝方法。我们将重点放在第三个上,这表明该验证系统还极大地改善了一组手写单词中关键字提取的结果,对词典大小变化具有很强的鲁棒性(已经测试了21个词典,从167个条目到5,600个条目)。与我们与人类合作的应用环境特别相关,并且仅由于此提议系统的拒绝能力才使之成为可能。与具有简单拒绝功能的HMM相比,提出的验证系统在给定的错误率为1%(分别为5%和10%)的情况下,平均识别率提高了57%(分别为33%和21%)。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号