首页> 外文会议> >Speeding-up Chinese character recognition in an automatic document reading system
【24h】

Speeding-up Chinese character recognition in an automatic document reading system

机译:在自动文件阅读系统中加速汉字识别

获取原文

摘要

We present two techniques for speeding up character recognition. Our character recognition system, including the candidate cluster selection and detail matching modules, is implemented using two statistical features: crossing counts and contour direction counts. In the training stage, we divide characters into different clusters. To keep a very high recognition rate, the candidate cluster selection module selects the top 60 clusters with minimal distances from among 300 predefined clusters. To further speed up the recognition speed, we use a modified branch and bound algorithm in the detail matching module. In the automatic document reading system, characters and punctuation marks are first extracted from printed document images and sorted according to their positions and the document orientation. The system then recognizes all printed Chinese characters between pairs of punctuation marks. The results are then spoken aloud by a speech synthesis system.
机译:我们提出了两种加速字符识别的技术。我们的字符识别系统(包括候选聚类选择和详细信息匹配模块)使用两种统计功能实现:交叉计数和轮廓方向计数。在训练阶段,我们将角色分为不同的类别。为了保持很高的识别率,候选聚类选择模块从300个预定义聚类中选择距离最小的前60个聚类。为了进一步加快识别速度,我们在详细信息匹配模块中使用了改进的分支定界算法。在自动文档读取系统中,首先从打印的文档图像中提取字符和标点符号,然后根据它们的位置和文档方向对其进行分类。然后,系统识别出成对的标点符号之间的所有印刷汉字。然后由语音合成系统大声说出结果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号