首页> 外文会议>Document Analysis Systems (DAS), 2012 10th IAPR International Workshop on >ExpressMatch: A System for Creating Ground-Truthed Datasets of Online Mathematical Expressions
【24h】

ExpressMatch: A System for Creating Ground-Truthed Datasets of Online Mathematical Expressions

机译:ExpressMatch:一种用于创建在线数学表达式的真实数据集的系统

获取原文
获取原文并翻译 | 示例

摘要

In recognition domains, publicly available ground-truthed datasets are essential to perform effective performance evaluation and comparison of existing methods and systems. However, in the field of online handwritten mathematical expression recognition, datasets are quite scarce and their creation is one of the current challenging issues. In this paper, we present Express Match, a system designed to help creation and management of online mathematical expression datasets with ground-truth data. In this system, handwritten model expressions can be input and manually annotated with ground-truth data, transcriptions of these expressions can be automatically annotated by matching them to the respective models. Additional metadata can also be attached to each sample expression. To test the system, a dataset consisting of 56 model expressions and 910 sample expressions with a total of 20,010 symbols, written by 25 different writers, has been created. This dataset, as well as Express Match, will be made publicly available.
机译:在识别领域,公开可用的真实数据集对于执行有效的性能评估以及现有方法和系统的比较至关重要。但是,在在线手写数学表达式识别领域,数据集非常稀缺,其创建是当前具有挑战性的问题之一。在本文中,我们介绍了Express Match,该系统旨在帮助创建和管理具有真实数据的在线数学表达式数据集。在此系统中,可以输入手写的模型表达式,并使用真实数据手动对其进行注释,然后通过将它们与各自的模型进行匹配,可以自动对这些表达式的转录进行注释。还可以将其他元数据附加到每个样本表达式。为了测试系统,已经创建了一个由56个模型表达式和910个样本表达式组成的数据集,由25个不同的编写者编写,总共具有20,010个符号。该数据集以及“快速匹配”将公开提供。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号