首页> 外文会议> >Influence of language models and candidate set size on contextual post-processing for Chinese script recognition

【24h】

Influence of language models and candidate set size on contextual post-processing for Chinese script recognition

机译：语言模型和候选集大小对中文脚本识别的上下文后处理的影响

获取原文

获取外文期刊封面目录资料

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

In the Chinese language, a word consisting of one or more characters is a basic syntax-meaningful unit, however, each character in the word also has a definite meaning in itself. We compare the perplexities of four n-gram language models (character-based bigram, character-based trigram, word-based bigram and class-based bigram) and their influence on the performance of contextual post-processing of Chinese scripts in an offline handwritten Chinese character recognition system. We also demonstrate the influence of the candidate set size on the performance of contextual post-processing in detail, and indicate that the number of candidates should vary with each script.

机译：在中文中，由一个或多个字符组成的单词是基本的有意义的语法单元，但是单词中的每个字符本身也具有确定的含义。我们比较了四种n语法语言模型（基于字符的双字母组，基于字符的三字母组，基于单词的双字母组和基于类的双字母组）的困惑及其对离线手写汉字的上下文后处理性能的影响。汉字识别系统。我们还将详细演示候选集大小对上下文后处理性能的影响，并指出候选数量应随每个脚本而变化。

著录项

来源
《》|2004年|p.537-540|共4页
会议地点
作者
Yuan-Xiang Li; Chew Lim Tan;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类无线电电子学、电信技术;
关键词
natural languages; text analysis; word processing; handwritten character recognition; language models; candidate set size; contextual post-processing; Chinese script recognition; basic syntax-meaningful unit; character-based bigram; character-based trigram; word-based bigram; class-based bigram; offline handwritten Chinese character recognition system;

机译：自然语言;文本分析;文字处理;手写字符识别;语言模型;候选集大小;上下文后处理;汉字识别;基本句法有意义的单元;基于字符的双字母组;基于字符的三字母组;基于单词的双字母组;基于类的二元体;离线手写汉字识别系统;

相似文献

外文文献
中文文献
专利

1. Contextual post-processing based on the confusion matrix in offline handwritten Chinese script recognition [J] . Li YX, Tan CL, Ding XQ, Pattern Recognition: The Journal of the Pattern Recognition Society . 2004,第9期

机译：离线手写汉字识别中基于混淆矩阵的上下文后处理
2. A HYBRID POST-PROCESSING SYSTEM FOR OFFLINE HANDWRITTEN CHINESE CHARACTER RECOGNITION BASED ON A STATISTICAL LANGUAGE MODEL [J] . RUIFENG XU, DANIEL S. YEUNG, DAMING SHI International Journal of Pattern Recognition and Artificial Intelligence . 2005,第3期

机译：基于统计语言模型的离线手写汉字识别混合后处理系统
3. A hybrid post-processing system for offline handwritten Chinese script recognition [J] . Yuan-Xiang Li, Chew Lim Tan, Xiaoqing Ding Pattern Analysis and Applications . 2005,第3期

机译：用于离线手写汉字识别的混合后处理系统
4. Influence of language models and candidate set size on contextual post-processing for Chinese script recognition [C] . Yuan-Xiang Li, Chew Lim Tan International Conference on Pattern Recognition . 2004

机译：语言模型和候选集规范对汉字识别上下文后处理的影响
5. (Re)Writing the Script of Second Language Teaching/Learning: Exploring Teacher Candidates' Conceptual Understanding of Drama-based Instruction [D] . Vetere, Timothy Matthew 2018

机译：（重新）撰写第二语言教学/学习的脚本：探索教师候选人对基于戏剧教学的概念理解
6. Novel Deep Convolutional Neural Network-Based Contextual Recognition of Arabic Handwritten Scripts [O] . Rami Ahmed, Mandar Gogate, Ahsen Tahir, 2021

机译：基于新型卷积神经网络的阿拉伯语手写脚本的新型卷积神经网络
7. Influence of Language Models and Candidate Set Size on Contextual Post-processing for Chinese Script Recognition [O] . Yuan-xiang Li, Chew Lim Tan 2004

机译：语言模型和候选集大小对中文脚本识别的上下文后处理的影响

Influence of language models and candidate set size on contextual post-processing for Chinese script recognition

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅