【24h】

Tools for Developing OCRs for Indian Scripts

机译:为印度脚本开发OCR的工具

获取原文

摘要

Development of OCRs for Indian script is an active area of research today. Indian scripts present great challenges to an OCR designer due to the large number of letters in the alphabet, the sophisticated ways in which they combine, and the complicated graphemes they result in. The problem is compounded by the unstructured manner in which popular fonts are designed. There is a lot of common structure in the different Indian scripts. In this paper, we argue that a number of automatic and semi-automatic tools can ease the development of recognizers for new font styles and new scripts. We discuss briefly three such tools we developed and show how they have helped build new OCRs. An integrated approach to the design of OCRs for all Indian scripts has great benefits. We are building OCRs for many Indian languages following this approach as part of a system to provide tools to create content in them.
机译:印度剧本的OCRS开发是今天的一个活跃的研究领域。 由于字母表中的大量字母,印度脚本对OCR设计师带来了极大的挑战,它们结合的复杂方式以及它们导致的复杂的格式。通过设计流行字体的非结构化方式复合了问题 。 不同的印度剧本中有很多共同结构。 在本文中,我们争辩说,许多自动和半自动工具可以缓解新字体样式和新脚本的识别员的开发。 我们简要讨论三种这样的工具,我们开发并展示了他们如何帮助建立新的OCR。 所有印度脚本的OCR设计的综合方法具有很大的好处。 我们正在为许多印度语言构建OCRS,这是一个系统的一部分,以提供在它们中创建内容的工具。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号