Matching based ground-truth annotation for online handwritten mathematical expressions

Hirata Nina S. T.; Julca-Aguilar Frank D.

首页> 外文期刊>Pattern Recognition: The Journal of the Pattern Recognition Society >Matching based ground-truth annotation for online handwritten mathematical expressions

【24h】

Matching based ground-truth annotation for online handwritten mathematical expressions

机译：基于匹配的地面真相注释，用于在线手写数学表达式

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Assessment of mathematical expression recognition at expression level only is not sufficient to diagnose strengths and weaknesses of different recognition systems. In order to make assessment at different levels possible, large datasets annotated with ground-truth data at different levels, such as at symbol segmentation, symbol classification, symbol/sub-expression spatial relationships, baselines or whole expression levels, are needed. Creation of ground-truthed datasets of handwritten mathematical expressions is a challenging task due to the need to cope with a large variability of symbol classes, expression layouts, writing styles, among other issues including the fact that manual annotation is an error-prone procedure. We propose an expression matching approach where symbols in a transcribed expression are assigned to the corresponding symbols in the respective model expression. Matching is formulated as a simple linear assignment problem where matching cost is defined as a weighted linear combination of local (symbol) and global (structural) characteristics. Once a symbol-to-symbol assignment is computed, not only symbol labels but all other ground-truth data attached to the model expression can be automatically transferred to the transcribed expression. We use two independent large test sets to empirically evaluate the influence of the cost function terms on matching performance. Results show mean symbol assignment rates above 99% on both sets, suggesting the potential of the method as an useful tool for helping the creation of ground-truthed online mathematical expression datasets. (C) 2014 Elsevier Ltd. All rights reserved.

机译：仅在表达水平上评估数学表达识别能力不足以诊断不同识别系统的优缺点。为了使在不同级别的评估成为可能，需要在不同级别（例如在符号分割，符号分类，符号/子表达空间关系，基线或整个表达级别）上标注真实数据的大型数据集。由于需要应对符号类，表达式布局，书写样式等多种变化，因此创建手写数学表达式的地面数据集是一项艰巨的任务，其中包括手动注释是易于出错的过程。我们提出一种表达式匹配方法，其中将转录表达式中的符号分配给各个模型表达式中的相应符号。匹配被描述为一个简单的线性分配问题，其中匹配成本定义为局部（符号）特征和全局（结构）特征的加权线性组合。一旦计算了符号到符号的分配，不仅符号标签，而且附加到模型表达式的所有其他实际数据也可以自动传输到转录的表达式。我们使用两个独立的大型测试集来凭经验评估成本函数项对匹配性能的影响。结果表明，两组的平均符号分配率均高于99％，这表明该方法具有潜力，可用于帮助创建真实的在线数学表达式数据集。（C）2014 Elsevier Ltd.保留所有权利。

著录项

来源
《Pattern Recognition: The Journal of the Pattern Recognition Society》 |2015年第3期|共12页
作者
Hirata Nina S. T.; Julca-Aguilar Frank D.;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类计算技术、计算机技术;
关键词
Structural pattern analysis; Shape information; Handwriting recognition; Linear assignment problem; Ground-truth annotation; Mathematical expression dataset;

机译：结构模式分析;形状信息;笔迹识别;线性分配问题;真实性注释;数学表达式数据集;

相似文献

外文文献
中文文献
专利

1. Matching based ground-truth annotation for online handwritten mathematical expressions [J] . Hirata Nina S. T., Julca-Aguilar Frank D. Pattern Recognition: The Journal of the Pattern Recognition Society . 2015,第3期

机译：基于匹配的地面真相注释，用于在线手写数学表达式
2. A tree-BLSTM-based recognition system for online handwritten mathematical expressions [J] . Neural computing & applications . 2020,第9期

机译：基于树-BLSTM的在线手写数学表达式的识别系统
3. Clustering online handwritten mathematical expressions [J] . Huy Quang Ung, Cuong Tuan Nguyen, Khanh Minh Phan, Pattern recognition letters . 2021,第Juna期

机译：聚类在线手写数学表达式
4. MST-based Visual Parsing of Online Handwritten Mathematical Expressions [C] . Lei Hu, Richard Zanibbi International Conference on Frontiers in Handwriting Recognition . 2016

机译：基于MST的在线手写数学表达式的可视化解析
5. Features and Algorithms for Visual Parsing of Handwritten Mathematical Expressions. [D] . Hu, Lei. 2016

机译：可视化手写数学表达式的功能和算法。
6. Online Handwritten Signature Verification Using Neural Network Classifier Based on Principal Component Analysis [O] . Vahab Iranmanesh, Sharifah Mumtazah Syed Ahmad, Wan Azizun Wan Adnan, -1

机译：基于主成分分析的神经网络分类器在线手写签名验证
7. A GRU-based Encoder-Decoder Approach with Attention for Online Handwritten Mathematical Expression Recognition [O] . Zhang, Jianshu, Du, Jun, Dai, Lirong 2017

机译：一种基于GRU的编码器 - 解码器方法，注重在线手写数学表达识别

Matching based ground-truth annotation for online handwritten mathematical expressions

摘要

著录项

相似文献

相关主题

期刊订阅