...
【24h】

MACHINE-PRINTED JAPANESE DOCUMENT RECOGNITION

机译:机印日语文件识别

获取原文
获取原文并翻译 | 示例
   

获取外文期刊封面封底 >>

       

摘要

Cherry Blossom is a general-purpose Japanese document recognition system developed at CEDAR. The input to the system can be facsimile pages or images scanned at low resolution. Given a Japanese document image, the system deskews the image, extracts text regions, segments text regions into text lines and further into characters, and recognizes character images as characters in JIS code. Two feature sets, the Local Stroke Direction feature and the Gradient, Structural, and Concavity feature, are used for character classification. Two classification methods, the nearest neighbor classifier and the minimum error subspace method, have been designed and they have been integrated to achieve better performance. We also describe the new Japanese character image database developed at CEDAR. This database consists of approximately 180,000 labeled character images from more than 3300 categories, extracted from diverse document images. Results of our system on this dataset are also presented. (C) 1997 Pattern Recognition Society. Published by Elsevier Science Ltd. [References: 28]
机译:樱花是由CEDAR开发的通用日语文件识别系统。系统的输入可以是传真页或以低分辨率扫描的图像。给定日语文档图像,系统对图像进行去歪斜,提取文本区域,将文本区域划分为文本行,然后进一步细分为字符,然后将字符图像识别为JIS代码中的字符。字符分类使用两个特征集,即“局部笔划方向”特征和“渐变”,“结构”和“凹度”特征。设计了两种分类方法,最近邻分类器和最小误差子空间法,并将它们集成在一起以获得更好的性能。我们还将介绍在CEDAR开发的新的日语字符图像数据库。该数据库包含从3300多个类别中提取的大约180,000个带标签的字符图像,这些图像是从各种文档图像中提取的。还介绍了我们在该数据集上的系统结果。 (C)1997模式识别学会。由Elsevier Science Ltd.发布[参考:28]

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号