首页> 外文会议>IAPR-TC-15 international workshop on graph-based representations in pattern recognition >Speeding-Up Graph-Based Keyword Spotting in Historical Handwritten Documents
【24h】

Speeding-Up Graph-Based Keyword Spotting in Historical Handwritten Documents

机译:加快历史手写文档中基于图的关键字发现

获取原文

摘要

The present paper is concerned with a graph-based system for Keyword Spotting (KWS) in historical documents. This particular system operates on segmented words that are in turn represented as graphs. The basic KWS process employs the cubic-time bipartite matching algorithm (BP). Yet, even though this graph matching procedure is relatively efficient, the computation time is a limiting factor for processing large volumes of historical manuscripts. In order to speed up our framework, we propose a novel fast rejection heuristic. This heuristic compares the node distribution of the query graph and the document graph in a polar coordinate system. This comparison can be accomplished in linear time. If the node distributions are similar enough, the BP matching is actually carried out (otherwise the document graph is rejected). In an experimental evaluation on two benchmark datasets we show that about 50% or more of the matchings can be omitted with this procedure while the KWS accuracy is not negatively affected.
机译:本文涉及历史文档中基于图的关键字发现(KWS)系统。该特定系统对分段词进行操作,这些分段词又表示为图形。基本的KWS过程使用立方时间二分匹配算法(BP)。但是,即使该图形匹配过程相对有效,但计算时间仍是处理大量历史手稿的限制因素。为了加快我们的框架,我们提出了一种新颖的快速拒绝启发式算法。该启发式方法比较了极坐标系中查询图和文档图的节点分布。可以在线性时间内完成此比较。如果节点分布足够相似,则实际执行BP匹配(否则拒绝文档图)。在对两个基准数据集的实验评估中,我们表明,使用此过程可以省略大约50%或更多的匹配,而不会对KWS准确性造成负面影响。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号