首页> 美国政府科技报告 >Automatic script identification from images using cluster-based templates

【24h】

Automatic script identification from images using cluster-based templates

机译：使用基于群集的模板从图像中自动识别脚本

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

We have developed a technique for automatically identifying the script used to generate a document that is stored electronically in bit image form. Our approach differs from previous work in that the distinctions among scripts are discovered by an automatic learning procedure, without any handson analysis. We first develop a set of representative symbols (templates) for each script in our database (Cyrillic, Roman, etc.). We do this by identifying all textual symbols in a set of training documents, scaling each symbol to a fixed size, clustering similar symbols, pruning minor clusters, and finding each cluster's centroid. To identify a new document's script, we identify and scale a subset of symbols from the document and compare them to the templates for each script. We choose the script whose templates provide the best match. Our current system distinguishes among the Armenian, Burmese, Chinese, Cyrillic, Ethiopic, Greek, Hebrew, Japanese, Korean, Roman, and Thai scripts with over 90% accuracy.

著录项

作者
Hochberg, J. ; Kerns, L. ; Kelly, P. ; Thomas, T.;
展开▼
作者单位

展开▼
年度 1995
页码 1-20
总页数 20
原文格式 PDF
正文语种 eng
中图分类工业技术;
关键词
Image Scanners; Automation; Design; Image Converters; Image Processing; Pattern Recognition; Photocopying; Meetings;

机译：图像扫描仪;自动化;设计;图像转换器;图像处理;模式识别;复印;会议;
入库时间 2022-08-29 11:05:47

相似文献

外文文献
中文文献
专利

1. Automatic script identification from document images using cluster-based templates [J] . Hochberg J., Kelly P. IEEE Transactions on Pattern Analysis and Machine Intelligence . 1997,第2期

机译：使用基于群集的模板从文档图像自动识别脚本
2. AUTOMATIC LINE-LEVEL SCRIPT IDENTIFICATION FROM HANDWRITTEN DOCUMENT IMAGES - A REGION-WISE CLASSIFICATION FRAMEWORK FOR INDIAN SUBCONTINENT [J] . Sk Md Obaidullah, Chayan Halder, K. C. Santosh, Malaysian Journal of Computer Science . 2018,第1期

机译：手写文档图像的自动行级脚本识别-印度次大陆的区域明智分类框架
3. Automatic Identification of Oriental and Other Scripts in Image Documents [J] . C. Y. SUEN, S. BERGLER, N. NOBILE, International Journal of Computer Processing of Oriental Languages . 2005,第2期

机译：自动识别图像文档中的东方文字和其他文字
4. Automatic script identification from images using cluster-based templates [C] . Hochberg, J., Kerns, . 1995

机译：使用基于群集的模板从图像自动识别脚本
5. Automatic detection of patient identification and positioning errors in radiotherapy treatment using 3D setup images. [D] . Jani, Shyam Shirish. 2015

机译：使用3D设置图像自动检测放射治疗中的患者识别和定位错误。
6. Automatic whole brain tract‐based analysis using predefined tracts in a diffusion spectrum imaging template and an accurate registration strategy [O] . Yu‐Jen Chen, Yu‐Chun Lo, Yung‐Chin Hsu, 2015

机译：使用扩散光谱成像模板中的预定义束和准确的配准策略自动进行基于全脑束的分析
7. Automatic Script Identification from Images Using Cluster-based Templates [O] . Judith Hochberg, Lila Kerns, Patrick Kelly, 1995

机译：使用基于群集的模板从图像中自动识别脚本

Automatic script identification from images using cluster-based templates

摘要

著录项

相似文献

相关主题

期刊订阅