首页> 外文会议>International Conference on Frontiers in Handwriting Recognition >Line-of-Sight Stroke Graphs and Parzen Shape Context Features for Handwritten Math Formula Representation and Symbol Segmentation
【24h】

Line-of-Sight Stroke Graphs and Parzen Shape Context Features for Handwritten Math Formula Representation and Symbol Segmentation

机译:用于手写数学公式表示和符号分割的视线描边图和Parzen形状上下文特征

获取原文

摘要

This paper presents a new representation for handwritten math formulae: a Line-of-Sight (LOS) graph over handwritten strokes, computed using stroke convex hulls. Experimental results using the CROHME 2012 and 2014 datasets show that LOS graphs capture the visual structure of handwritten formulae better than commonly used graphs such as Time-series, Minimum Spanning Trees, and k-Nearest Neighbor graphs. We then introduce a shape context-based feature (Parzen window Shape Contexts (PSC)) which is combined with simple geometric features and the distance in time between strokes to obtain state-of-the-art symbol segmentation results (92.43% F-measure for CROHME 2014). This result is obtained using a simple method, without use of OCR or an expression grammar. A binary random forest classifier identifies which LOS graph edges represent stroke pairs that should be merged into symbols, with connected components over merged strokes defining symbols. Line-of-Sight graphs and Parzen Shape Contexts represent visual structure well, and might be usefully applied to other notations.
机译:本文提出了一种手写数学公式的新表示形式:使用笔划凸包计算的笔划的视线(LOS)图。使用CROHME 2012和2014数据集的实验结果表明,与时间序列图,最小生成树图和k最近邻图等常用图相比,LOS图更好地捕获了手写公式的视觉结构。然后,我们引入基于形状上下文的特征(Parzen窗口形状上下文(PSC)),该特征与简单的几何特征以及笔画之间的时间间隔相结合,以获得最新的符号分割结果(92.43%F-measure适用于CROHME 2014)。使用简单的方法即可获得此结果,而无需使用OCR或表达式语法。二进制随机森林分类器可识别哪些LOS图形边缘代表应合并为符号的笔画对,并在合并的笔画上定义相连的笔画。视线图和Parzen形状上下文很好地表示了视觉结构,并且可能会有用地应用于其他符号。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号