...
首页> 外文期刊>Pattern Recognition: The Journal of the Pattern Recognition Society >Separability versus prototypicality in handwritten word-image retrieval
【24h】

Separability versus prototypicality in handwritten word-image retrieval

机译:手写单词图像检索中的可分离性与原型性

获取原文
获取原文并翻译 | 示例
   

获取外文期刊封面封底 >>

       

摘要

Hit lists are at the core of retrieval systems. The top ranks are important, especially if user feedback is used to train the system. Analysis of hit lists revealed counter-intuitive instances in the top ranks for good classifiers. In this study, we propose that two functions need to be optimised: (a) in order to reduce a massive set of instances to a likely subset among ten thousand or more classes, separability is required. However, the results need to be intuitive after ranking, reflecting (b) the prototypicality of instances. By optimising these requirements sequentially, the number of distracting images is strongly reduced, followed by nearest-centroid based instance ranking that retains an intuitive (low-edit distance) ranking. We show that in handwritten word-image retrieval, precision improvements of up to 35 percentage points can be achieved, yielding up to 100% top hit precision and 99% top-7 precision in data sets with 84 000 instances, while maintaining high recall performances. The method is conveniently implemented in a massive scale, continuously trainable retrieval engine, Monk.
机译:命中列表是检索系统的核心。最高排名很重要,尤其是如果使用用户反馈来训练系统时。对命中列表的分析揭示了在良好分类器中排名最高的实例。在这项研究中,我们建议需要优化两个功能:(a)为了将大量实例减少到上万个或更多类的可能子集,需要可分离性。但是,排序后的结果必须是直观的,以反映(b)实例的原型。通过依次优化这些要求,可以大大减少分散注意力的图像的数量,然后进行基于最近质心的实例排名,从而保留直观的(低编辑距离)排名。我们表明,在手写文字图像检索中,可以实现高达35个百分点的精度改进,在具有84 000个实例的数据集中,最高命中精度和99%最高7位精度得到了提高,同时保持了较高的召回性能。该方法可在大规模,可连续训练的检索引擎Monk中方便地实现。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号