首页> 外文会议>International Conference on Frontiers in Handwriting Recognition >Writing Type and Language Identification in Heterogeneous and Complex Documents
【24h】

Writing Type and Language Identification in Heterogeneous and Complex Documents

机译:异构和复杂文档中的写作类型和语言识别

获取原文

摘要

This paper presents a system dedicated to automatic recognition of both the writing type and the language of text regions in heterogeneous and complex documents. This system is able to process documents with mixed printed and handwritten text, in various languages (French, English and Arabic). To handle such a problem, we divided it into two sub-tasks: The writing type identification and the language identification. The method for the writing type recognition is based on the analysis of the connected components while the language identification approach combines the analysis of connected components and the analysis of character distributions. We present the results obtained by the system during the second competition round of the MAURDOR campaign, and show that the performance of our system compares favorably with other participants.
机译:本文提出了一种系统,用于自动识别异构​​和复杂文档中文本区域的书写类型和语言。该系统能够处理多种语言(法语,英语和阿拉伯语)的混合印刷和手写文本的文档。为了解决这个问题,我们将其分为两个子任务:写作类型识别和语言识别。书写类型识别的方法基于对连接组件的分析,而语言识别方法则将对连接组件的分析和字符分布的分析结合在一起。我们介绍了该系统在MAURDOR广告系列的第二轮竞赛中获得的结果,并表明我们的系统性能与其他参与者相比具有优势。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号