首页> 外文期刊>International Journal of Pattern Recognition and Artificial Intelligence >KEYWORD SPOTTING FROM ONLINE CHINESE HANDWRITTEN DOCUMENTS USING ONE-VERSUS-ALL CHARACTER CLASSIFICATION MODEL
【24h】

KEYWORD SPOTTING FROM ONLINE CHINESE HANDWRITTEN DOCUMENTS USING ONE-VERSUS-ALL CHARACTER CLASSIFICATION MODEL

机译:使用一对多特征分类模型从中文手写文档中发现关键词

获取原文
获取原文并翻译 | 示例
           

摘要

In this paper, we propose a method for text-query-based keyword spotting from online Chinese handwritten documents using character classification model. The similarity between the query word and handwriting is obtained by combining the character classification scores. The classifier is trained by one-versus-all strategy so that it gives high similarity to the target class and low scores to the others. Using character classification-based word similarity also helps overcome the out-of-vocabulary (OOV) problem. We use a character-synchronous dynamic search algorithm to efficiently spot the query word in large database. The retrieval performance is further improved by using competing character confusion and writer-adaptive thresholds. Our experimental results on a large handwriting database CASIA-OLHWDB justify the superiority of one-versus-all trained classifiers and the benefits of confidence transformation, character confusion and adaptive thresholds. Particularly, a one-versus-all trained prototype classifier performs as well as a linear support vector machine (SVM) classifier, but consumes much less storage of index file. The experimental comparison with keyword spotting based on handwritten text recognition also demonstrates the effectiveness of the proposed method.
机译:本文提出了一种基于字符分类模型的在线中文手写文档中基于文本查询的关键词识别方法。通过组合字符分类得分,可以获得查询词和笔迹之间的相似度。该分类器通过一对多策略进行训练,因此它与目标类具有很高的相似性,而其他类则得分较低。使用基于字符分类的单词相似性还有助于克服词汇不足(OOV)问题。我们使用字符同步动态搜索算法来有效地发现大型数据库中的查询词。通过使用竞争性字符混淆和作家自适应阈值,可以进一步提高检索性能。我们在大型手写数据库CASIA-OLHWDB上的实验结果证明了所有训练有素的分类器的优越性以及置信度转换,字符混淆和自适应阈值的好处。特别是,一个经过训练的原型分类器的性能与线性支持向量机(SVM)分类器一样好,但是消耗的索引文件存储量却少得多。与基于手写文本识别的关键词识别的实验比较也证明了该方法的有效性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号