首页> 外文期刊>The Journal of Documentation >Exploring manual and automatic query formulation in patent IR Initial query construction and query generation process
【24h】

Exploring manual and automatic query formulation in patent IR Initial query construction and query generation process

机译:探索专利IR初始查询构造和查询生成过程中的手动和自动查询公式

获取原文
获取原文并翻译 | 示例

摘要

Purpose - This study aims to examine manually formulated queries and automatic query generation in an early phase of a patent "prior art" search. Design/methodology/approach - The study was performed partly within a patent domain setting, involving three professional patent examiners, and partly in the context of the CLEF 2009 Intellectual Property (CLEF-IP) track. For the exploratory study of user-based query formulation, three patent examiners performed the same three simulated real-life patent tasks. For the automatic query generation, a simple term-weighting algorithm based on the RATF formula was used. The manually and automatically created queries were compared to analyse what kinds of keywords and from which parts of the patent documents were selected. Findings - For user-formulated queries, it was found that patent documents were read in a specific order of importance and that the time varied. Annotations and collaboration were made while reading and selecting/ranking terms. Ranking terms was experienced to be harder than selecting terms. For the automatic formulated queries, it was found that the term frequencies used in the RATF alone will not quite approximate what terms will be judged as relevant query terms by the users. Simultaneously, the results suggest that developing a query generation tool for generating initial queries based on patent documents is feasible. Research limitations/implications - These preliminary but informative results need to be viewed in the light that only three patent experts were observed and that a small set of topics was used. Originality/value - It is usually difficult to get access to the setting of the patent domain and the results of the study show that the methodology provided a feasible way to study manual and the manual query formulation of the patent engineer.
机译:目的-本研究旨在检查专利“现有技术”搜索的早期阶段中手动制定的查询和自动查询生成。设计/方法/方法-这项研究部分是在专利领域中进行的,涉及三名专业专利审查员,另一部分是在CLEF 2009知识产权(CLEF-IP)跟踪范围内进行的。为了对基于用户的查询方式进行探索性研究,三名专利审查员执行了相同的三项模拟现实生活中的专利任务。为了自动生成查询,使用了一个基于RATF公式的简单术语加权算法。比较了手动和自动创建的查询,以分析哪些类型的关键字以及从哪些部分选择了专利文件。调查结果-对于用户提出的查询,发现专利文件的阅读顺序是特定的,并且时间有所不同。阅读和选择/排序术语时进行了注释和协作。排名术语比选择术语更难。对于自动制定的查询,发现仅在RATF中使用的术语频率不会完全近似于用户将哪些术语视为相关查询术语。同时,结果表明,开发用于基于专利文档生成初始查询的查询生成工具是可行的。研究的局限性/意义-鉴于只有三名专利专家并且只使用了一小部分主题,因此需要对这些初步而有益的结果进行研究。原创性/价值-通常很难访问专利领域的设置,研究结果表明,该方法为研究手册和专利工程师的手动查询公式提供了一种可行的方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号