首页> 外文会议>12th International Conference on Frontiers in Handwriting Recognition >Retrieving Handwriting Styles: A Content Based Approach to Handwritten Document Retrieval
【24h】

Retrieving Handwriting Styles: A Content Based Approach to Handwritten Document Retrieval

机译:检索手写样式:一种基于内容的手写文档检索方法

获取原文

摘要

Large scale retrieval of handwritten documents has primarily been focused around searching a query text in the OCRȁ9;ed transcription of the document images, which provides a limited view of the complete search process. Recent research advances have led to a number of content based retrieval techniques which expand the search scope to document content level (i.e. image features, meta-information). Based on similar motivations, we propose a new approach to content based retrieval of handwritten document images by retrieving similar handwriting styles corresponding to a handwritten query image. At the core, we formulate this problem as the task of unsupervised writer style classification without the need of any style definitions or grammar. We build upon our previous work in writer style modeling and apply it to learn a style distribution for every handwriting sample in the corpus. Given a query image, all documents are ranked in order of their style distribution similarity. Experimental results conducted on publicly available IAM dataset demonstrate the efficacy of our proposed method over baseline feature based systems.
机译:手写文档的大规模检索主要集中在OCRȁ9中搜索查询文本;文档图像的转录,这提供了完整搜索过程的有限视图。最近的研究进展已导致许多基于内容的检索技术,这些技术将搜索范围扩展到文档内容级别(即图像特征,元信息)。基于相似的动机,我们提出了一种新的方法,通过检索与手写查询图像相对应的相似手写样式,来基于内容的手写文档图像检索。从根本上讲,我们将此问题表述为无监督作者样式分类的任务,而无需任何样式定义或语法。我们以先前在作者样式建模方面的工作为基础,并将其应用于学习语料库中每个手写样本的样式分布。给定一个查询图像,所有文档均按照其样式分布相似性进行排序。在公开可用的IAM数据集上进行的实验结果证明了我们提出的方法相对于基于基线特征的系统的有效性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号