首页> 外文会议>IAPR International Conference on Document Analysis and Recognition >Improving Information Retrieval in Multiwriter Scenario by Exploiting the Similarity Graph of Document Terms
【24h】

Improving Information Retrieval in Multiwriter Scenario by Exploiting the Similarity Graph of Document Terms

机译:利用文档条款的相似性图,提高多错台方案中的信息检索

获取原文

摘要

Information Retrieval (IR) is the activity of obtaining information resources relevant to a questioned information. It usually retrieves a set of objects ranked according to the relevancy to the needed fact. In document analysis, information retrieval receives a lot of attention in terms of symbol and word spotting. However, through decades the community mostly focused either on printed or on single writer scenario, where the state-of-the-art results have achieved reasonable performance on the available datasets. Nevertheless, the existing algorithms do not perform accordingly on multiwriter scenario. A graph representing relations between a set of objects is a structure where each node delineates an individual element and the similarity between them is represented as a weight on the connecting edge. In this paper, we explore different analytics of graphs constructed from words or graphical symbols, such as diffusion, shortest path, etc. to improve the performance of information retrieval methods in multiwriter scenario.
机译:信息检索(IR)是获取与质疑信息相关的信息资源的活动。它通常检索根据所需事实的相关性排名的一组对象。在文档分析中,信息检索在符号和单词斑点方面接收很多关注。然而,通过数十年来,社区主要集中在印刷或单一作家场景中,最先进的结果在可用数据集中实现了合理的性能。然而,现有算法在多错台方案上不执行相应的算法。表示一组对象之间的关系的图形是每个节点描绘单个元素的结构,并且它们之间的相似性被表示为连接边缘的权重。在本文中,我们探讨了由单词或图形符号构建的图形的不同分析,例如扩散,最短路径等,以提高多错机方案中信息检索方法的性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号