首页> 外文会议>International Conference on Frontiers in Handwriting Recognition >Retrieving Handwriting Styles: A Content Based Approach to Handwritten Document Retrieval
【24h】

Retrieving Handwriting Styles: A Content Based Approach to Handwritten Document Retrieval

机译:检索手写样式:基于内容的手写文档检索方法

获取原文

摘要

Large scale retrieval of handwritten documents has primarily been focused around searching a query text in the OCR’ed transcription of the document images, which provides a limited view of the complete search process. Recent research advances have led to a number of content based retrieval techniques which expand the search scope to document content level (i.e. image features, meta-information). Based on similar motivations, we propose a new approach to content based retrieval of handwritten document images by retrieving similar handwriting styles corresponding to a handwritten query image. At the core, we formulate this problem as the task of unsupervised writer style classification without the need of any style definitions or grammar. We build upon our previous work in writer style modeling and apply it to learn a style distribution for every handwriting sample in the corpus. Given a query image, all documents are ranked in order of their style distribution similarity. Experimental results conducted on publicly available IAM dataset demonstrate the efficacy of our proposed method over baseline feature based systems.
机译:手写文档的大规模检索主要集中在文档图像的OCR'ED转录中搜索查询文本,该文件提供了完整搜索过程的有限视图。最近的研究进步导致了许多基于内容的检索技术,它将搜索范围扩展到文档内容级别(即图像特征,元信息)。基于类似的动机,通过检索与手写查询图像相对应的类似的笔迹样式提出了一种基于手写文档图像的内容的新方法。在核心,我们将这个问题作为无监督者风格分类的任务,而无需任何风格定义或语法。我们建立在我们以前的作家风格建模工作中,并应用它来学习语料库中每种手写样本的风格分布。鉴于查询图像,所有文档都按照其样式分布相似度排序。在公开的IAM数据集上进行的实验结果证明了我们提出的方法对基于基线特征的系统的效果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号