首页> 外文会议>International Conference on Document Analysis and Recognition >A Novel Procedure to Speed up the Transcription of Historical Handwritten Documents by Interleaving Keyword Spotting and user Validation
【24h】

A Novel Procedure to Speed up the Transcription of Historical Handwritten Documents by Interleaving Keyword Spotting and user Validation

机译:通过交织关键词发现和用户验证来加快历史手写文档转录的新过程

获取原文

摘要

We propose a novel procedure to speed-up the content transcription of handwritten documents in digital historical archives when a keyword spotting system is used for the purpose. Instead of performing the validation of the system outputs in a single step, as it is customary, the proposed methodology envisaged a multi-step validation process to be embedded into a human-in-the-loop approach. At each step, the system outputs are validated and, whenever an image word that does not correspond to any entry of the keyword list is mistakenly returned by the system, its correct transcription is entered and used to query the system in the next step. The performance of our approach has been experimentally evaluated in terms of the total time to achieve the complete transcription of a subset of documents from the Bentham dataset. The results confirm that interleaving keyword spotting by the system and validation by the user leads to a significant reduction of the time required to transcribe the document content with respect to both the manual transcription and the traditional end-of-the-loop validation process.
机译:我们提出了一种新颖的程序,当使用关键字查找系统时,可以加快数字历史档案中手写文档的内容转录。提议的方法不是像通常那样在单个步骤中执行系统输出的验证,而是设想将多步骤验证过程嵌入到“人在环法”中。在每个步骤中,都会验证系统输出,并且每当系统错误地返回了与关键字列表的任何条目都不对应的图像单词时,都会输入其正确的转录,并在下一步中用于查询系统。我们已根据总时间从边沁数据集中完成文档子集的完全转录进行了实验评估,我们的方法的性能得到了验证。结果证实,相对于手动转录和传统的循环验证过程而言,由系统进行的关键字关键词交织和用户的验证导致转录文档内容所需的时间显着减少。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号