首页> 外文会议>IAPR International Workshop on Document Analysis Systems >Entity Local Structure Graph Matching for Mislabeling Correction
【24h】

Entity Local Structure Graph Matching for Mislabeling Correction

机译:实体局部结构图匹配误标标校正

获取原文

摘要

This paper proposes an entity local structure comparison approach based on inexact subgraph matching. The comparison results are used for mislabeling correction in the local structure. The latter represents a set of entity attribute labels which are physically close in a document image. It is modeled by an attributed graph describing the content and presentation features of the labels by the nodes and the geometrical features by the arcs. A local structure graph is matched with a structure model which represents a set of local structure model graphs. The structure model is initially built using a set of well chosen local structures based on a graph clustering algorithm and is then incrementally updated. The subgraph matching adopts a specific cost function that integrates the feature dissimilarities. The matched model graph is used to extract the missed labels, prune the extraneous ones and correct the erroneous label fields in the local structure. The evaluation of the structure comparison approach on 525 local structures extracted from 200 business documents achieves about 90% for recall and 95% for precision. The mislabeling correction rates in these local structures vary between 73% and 100%.
机译:本文提出了一种基于不精确子图匹配的实体局部结构比较方法。比较结果用于局部结构中的错误标记校正。后者表示一组实体属性标签,其在物理上靠近文档图像。它是由描述节点的标签的内容和呈现特征的归属图来建模,并且由弧通过弧形的几何特征。本地结构图与表示一组局部结构模型图的结构模型匹配。最初使用基于图形聚类算法的一组良好选择的本地结构构建结构模型,然后逐步更新。子图匹配采用特定的成本函数,该功能集成了特征异化。匹配的模型图用于提取未错过的标签,修剪外来的标签,并纠正本地结构中的错误标签字段。从200个商业文件中提取的525个局部结构的结构比较方法的评估达到了约90%的召回和95%以获得精度。这些局部结构中的误标表矫正率在73%和100%之间变化。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号