首页> 外文会议>Document Recognition III >Document-specific character template estimation

【24h】

Document-specific character template estimation

机译：特定于文档的字符模板估计

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Abstract: An approach to supervised training of document-specific character templates from sample page images and unaligned transcriptions is presented. The template estimation problem is formulated as one of constrained maximum likelihood parameter estimation within the document image decoding (DID) framework. This leads to a two-phase iterative training algorithm consisting of transcription alignment and aligned template estimation (ATE) steps. The ATE step is the heart of the algorithm and involves assigning template pixel colors to maximize likelihood while satisfying a template disjointedness constraint. The training algorithm is demonstrated on a variety of English documents, including newspaper columns, 15th century books, degraded images of 19th century newspapers, and connected text in a script-like font. Three applications enabled by the training procedure are described - high accuracy document-specific decoding, transcription error visualization and printer font generation. !14

机译：摘要：提出了一种从样本页面图像和未对齐转录中监督训练特定文档字符模板的方法。模板估计问题被公式化为文档图像解码（DID）框架内受约束的最大似然参数估计之一。这导致了一个两阶段的迭代训练算法，该算法包括转录比对和比对模板估计（ATE）步骤。 ATE步骤是算法的核心，涉及分配模板像素颜色以在满足模板不相交约束的同时最大程度地提高似然度。该训练算法在各种英文文档中得到了证明，包括报纸专栏，15世纪的书籍，19世纪报纸的退化图像以及类似脚本的字体连接文本。描述了培训程序支持的三个应用程序-高精度文档特定的解码，转录错误可视化和打印机字体生成。！14

著录项

来源
《Document Recognition III》|1996年|p.14-26|共13页
会议地点
作者
Gary E. Kopec; Xerox Palo Alto Research Ctr.; Palo Alto; CA; USA; Mauricio Lomelin; Microsoft Corp.; Palo Alto; CA; USA.;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类自动化技术、计算机技术;
关键词

相似文献

外文文献
中文文献
专利

1. Improving MT post-editing productivity with adaptive confidence estimation for document-specific translation model [J] . Fei Huang, Jian-Ming Xu, Abraham Ittycheriah, Machine translation . 2014,第3a4期

机译：通过针对文档特定翻译模型的自适应置信度估计来提高MT的后期编辑生产力
2. 3 ' end additions by T7 RNA polymerase are RNA self-templated, distributive and diverse in character-RNA-Seq analyses [J] . Gholamalipour Yasaman, Mudiyanselage Aruni Karunanayake, Martin Craig T. Nucleic Acids Research . 2018,第18期

机译：通过T7 RNA聚合酶的3'结束添加是RNA自模塑，分配和不同于性质-RNA-SEQ分析
3. 3′ end additions by T7 RNA polymerase are RNA self-templated, distributive and diverse in character—RNA-Seq analyses [J] . Yasaman Gholamalipour, Aruni Karunanayake?Mudiyanselage, Craig T Martin Nucleic acids research . 2018,第18期

机译：T7 RNA聚合酶的3'末端添加是RNA自我模板化，分布性强且特征多样-RNA-Seq分析
4. Document-specific character template estimation [C] . Gary E. Kopec, Mauricio Lomelin Conference on document recognition . 1996

机译：特定于文档的字符模板估计
5. Statistics on computational anatomy: From template estimation to geodesically controlled diffeomorphic active shapes. [D] . Ma, Jun. 2011

机译：计算解剖学的统计数据：从模板估计到测地控制的微晶活动形状。
6. A Very High-Speed Validation Scheme Based on Template Matching for Segmented Character Expiration Codes on Beverage Cans [O] . José C. Rodríguez-Rodríguez, Gabriele S. de Blasio, Carmelo R. García, 2020

机译：基于模板匹配的饮料罐分段字符到期码超高速验证方案
7. Document-Specific Character Template Estimation [O] . Gary Kopec, Mauricio Lomelin 1996

机译：文档特定的字符模板估计

Document-specific character template estimation

摘要

著录项

相似文献

相关主题

期刊订阅