首页> 外文会议>Workshop of the cross-language evaluation forum;CLEF 2009;European conference on digital libraries >Simple Pre and Post Processing Strategies for Patent Searching in CLEF Intellectual Property Track 2009
【24h】

Simple Pre and Post Processing Strategies for Patent Searching in CLEF Intellectual Property Track 2009

机译:CLEF Intellectual Property Track 2009专利检索的简单前后处理策略

获取原文

摘要

The objective of the 2009 CLEF-IP Track was to find documents that constitute prior art for a given patent. We explored a wide range of simple preprocessing and post-processing strategies, using Mean Average Precision (MAP) for evaluation purposes. Once determined the best document representation, we tuned a classical Information Retrieval engine in order to perform the retrieval step. Finally, we explored two different post-processing strategies. In our experiments, using the complete IPC codes for filtering purposes led to greater improvements than using 4-digits IPC codes. The second postprocessing strategy was to exploit the citations of retrieved patents in order to boost scores of cited patents. Combining all selected strategies, we computed optimal runs that reached a MAP of 0.122 for the training set, and a MAP of 0.129 for the official 2009 CLEF-IP XL set.
机译:2009 CLEF-IP Track的目标是找到构成给定专利的现有技术的文档。我们使用平均平均精度(MAP)进行评估,探索了各种简单的预处理和后处理策略。一旦确定了最佳的文档表示形式,我们就对经典的信息检索引擎进行了调整,以执行检索步骤。最后,我们探索了两种不同的后处理策略。在我们的实验中,使用完整的IPC代码进行过滤比使用4位IPC代码带来了更大的改进。第二种后处理策略是利用检索到的专利的引文来提高被引用专利的分数。结合所有选定的策略,我们计算出的最佳跑步训练集的MAP达到0.122,而2009年官方CLEF-IP XL套装的MAP达到0.129。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号