...
首页> 外文期刊>International Journal on Document Analysis and Recognition (IJDAR) >Handwritten text separation from annotated machine printed documents using Markov Random Fields
【24h】

Handwritten text separation from annotated machine printed documents using Markov Random Fields

机译:使用马尔可夫随机域从带注释的机器打印文档中手写文本分离

获取原文
获取原文并翻译 | 示例
   

获取外文期刊封面封底 >>

       

摘要

The convenience of search, both on the personal computer hard disk as well as on the web, is still limited mainly to machine printed text documents and images because of the poor accuracy of handwriting recognizers. The focus of research in this paper is the segmentation of handwritten text and machine printed text from annotated documents sometimes referred to as the task of “ink separation” to advance the state-of-art in realizing search of hand-annotated documents. We propose a method which contains two main steps—patch level separation and pixel level separation. In the patch level separation step, the entire document is modeled as a Markov Random Field (MRF). Three different classes (machine printed text, handwritten text and overlapped text) are initially identified using G-means based classification followed by a MRF based relabeling procedure. A MRF based classification approach is then used to separate overlapped text into machine printed text and handwritten text using pixel level features forming the second step of the method. Experimental results on a set of machine-printed documents which have been annotated by multiple writers in an office/collaborative environment show that our method is robust and provides good text separation performance.
机译:由于手写识别器的准确性较差,在个人计算机硬盘上和在网络上的搜索便利仍然主要限于机器打印的文本文档和图像。本文的研究重点是从带注释的文档中分割手写文本和机器打印的文本,有时也称为“墨水分离”任务,以推进实现手工注释文档搜索的最新技术。我们提出了一种方法,该方法包含两个主要步骤-补丁电平分离和像素电平分离。在补丁程序级别分离步骤中,将整个文档建模为马尔可夫随机字段(MRF)。最初使用基于G均值的分类,然后是基于MRF的重新标记过程,首先确定了三个不同的类别(机器印刷文本,手写文本和重叠文本)。然后使用基于MRF的分类方法,使用形成该方法第二步的像素级特征,将重叠的文本分为机器打印的文本和手写的文本。在一组由办公室/协作环境中的多位作者注释的机器打印文档上的实验结果表明,我们的方法是可靠的,并提供了良好的文本分离性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号