...
首页> 外文期刊>Journal of Imaging >Efficient Query Specific DTW Distance for Document Retrieval with Unlimited Vocabulary
【24h】

Efficient Query Specific DTW Distance for Document Retrieval with Unlimited Vocabulary

机译:无限词汇量的文档检索的有效查询特定DTW距离

获取原文

摘要

In this paper, we improve the performance of the recently proposed Direct Query Classifier ( dqc ). The ( dqc ) is a classifier based retrieval method and in general, such methods have been shown to be superior to the OCR-based solutions for performing retrieval in many practical document image datasets. In ( dqc ), the classifiers are trained for a set of frequent queries and seamlessly extended for the rare and arbitrary queries. This extends the classifier based retrieval paradigm to an unlimited number of classes (words) present in a language. The ( dqc ) requires indexing cut-portions (n-grams) of the word image and dtw distance has been used for indexing. However, dtw is computationally slow and therefore limits the performance of the ( dqc ). We introduce query specific dtw distance, which enables effective computation of global principal alignments for novel queries. Since the proposed query specific dtw distance is a linear approximation of the dtw distance, it enhances the performance of the ( dqc ). Unlike previous approaches, the proposed query specific dtw distance uses both the class mean vectors and the query information for computing the global principal alignments for the query. Since the proposed method computes the global principal alignments using n-grams, it works well for both frequent and rare queries. We also use query expansion ( qe ) to further improve the performance of our query specific dtw . This also allows us to seamlessly adapt our solution to new fonts, styles and collections. We have demonstrated the utility of the proposed technique over 3 different datasets. The proposed query specific dtw performs well compared to the previous dtw approximations.
机译:在本文中,我们提高了最近提出的直接查询分类器(dqc)的性能。 (dqc)是基于分类器的检索方法,通常,已证明此类方法优于基于OCR的解决方案,可在许多实际文档图像数据集中执行检索。在(dqc)中,对分类器进行了一系列频繁查询的训练,并针对稀有查询和任意查询进行了无缝扩展。这将基于分类器的检索范式扩展到一种语言中存在的无限多个类(单词)。 (dqc)需要索引词图像的切面部分(n-gram),并且dtw距离已用于索​​引。但是,dtw计算速度较慢,因此限制了(dqc)的性能。我们介绍了特定于查询的dtw距离,从而可以有效地计算新颖查询的全局主体对齐方式。由于建议的查询特定dtw距离是dtw距离的线性近似,因此可以提高(dqc)的性能。与以前的方法不同,建议的特定于查询的dtw距离使用类均值向量和查询信息来计算查询的全局主要比对。由于所提出的方法使用n元语法来计算全局主体对齐方式,因此对于频繁查询和罕见查询都适用。我们还使用查询扩展(qe)来进一步提高特定查询dtw的性能。这也使我们可以无缝地将解决方案适应新的字体,样式和集合。我们已经在3个不同的数据集上证明了该技术的实用性。与先前的dtw近似相比,建议的查询特定dtw表现良好。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号