首页> 外文会议>Document recognition and retrieval XXI >Form similarity via Levenshtein distance between ortho-filtered logarithmic ruling-gap ratios
【24h】

Form similarity via Levenshtein distance between ortho-filtered logarithmic ruling-gap ratios

机译:通过Levenshtein距离进行正交滤波后的对数统治间隙比之间的形式相似性

获取原文
获取原文并翻译 | 示例

摘要

Geometric invariants are combined with edit distance to compare the ruling configuration of noisy filled-out forms. It is shown that gap-ratios used as features capture most of the ruling information of even low-resolution and poorly scanned form images, and that the edit distance is tolerant of missed and spurious rulings. No preprocessing is required and the potentially time-consuming string operations are performed on a sparse representation of the detected rulings. Based on edit distance, 158 Arabic forms are classified into 15 groups with 89% accuracy. Since the method was developed for an application that precludes public dissemination of the data, it is illustrated on public-domain death certificates.
机译:将几何不变量与编辑距离相结合,以比较嘈杂填写表格的统治配置。结果表明,使用间隙比率作为特征可以捕获即使是低分辨率和扫描效果较差的表单图像的大多数裁定信息,并且编辑距离可以容忍遗漏和伪造的裁定。不需要预处理,并且对检测到的规则的稀疏表示执行可能耗时的字符串操作。根据编辑距离,将158种阿拉伯语表格分为15组,准确度为89%。由于该方法是为禁止公开发布数据的应用程序开发的,因此在公共领域的死亡证书上对此进行了说明。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号