【24h】

Are Characters Objects?

机译:是角色对象吗?

获取原文

摘要

This paper presents a character recognition system that handles degraded manuscript documents like the ones discovered at the St. Catherineȁ9;s Monastery. In contrast to state-of-the-art OCR systems, no early decision (image binarization) needs to be performed. Thus, an object recognition methodology is adapted for the recognition of ancient manuscripts. The proposed system is based on local descriptors which are clustered in order to localize characters. Finally, a class probability histogram is assigned to each character present in an image which allows for the character classification. The system achieves an F0.5 score of 0.77 on real world data that contains 13.5% highly degraded characters.
机译:本文提出了一种字符识别系统,该系统可以处理手稿退化的文件,例如在圣凯瑟琳9修道院发现的手稿文件。与最新的OCR系统相比,无需执行任何早期决策(图像二值化)。因此,对象识别方法学适用于古代手稿的识别。所提出的系统基于局部描述符,这些局部描述符被聚类以定位字符。最后,将类别概率直方图分配给图像中存在的每个字符,以进行字符分类。该系统在包含13.5%高度退化字符的真实世界数据上的F0.5得分为0.77。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号