Exploring manual and automatic query formulation in patent IR Initial query construction and query generation process

Preben Hansen; Anni Jarvelin; Antti Jarvelin

首页> 外文期刊>The Journal of Documentation >Exploring manual and automatic query formulation in patent IR Initial query construction and query generation process

【24h】

Exploring manual and automatic query formulation in patent IR Initial query construction and query generation process

机译：探索专利IR初始查询构造和查询生成过程中的手动和自动查询公式

获取原文

获取原文并翻译 | 示例

开具论文收录证明 >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

Purpose - This study aims to examine manually formulated queries and automatic query generation in an early phase of a patent "prior art" search. Design/methodology/approach - The study was performed partly within a patent domain setting, involving three professional patent examiners, and partly in the context of the CLEF 2009 Intellectual Property (CLEF-IP) track. For the exploratory study of user-based query formulation, three patent examiners performed the same three simulated real-life patent tasks. For the automatic query generation, a simple term-weighting algorithm based on the RATF formula was used. The manually and automatically created queries were compared to analyse what kinds of keywords and from which parts of the patent documents were selected. Findings - For user-formulated queries, it was found that patent documents were read in a specific order of importance and that the time varied. Annotations and collaboration were made while reading and selecting/ranking terms. Ranking terms was experienced to be harder than selecting terms. For the automatic formulated queries, it was found that the term frequencies used in the RATF alone will not quite approximate what terms will be judged as relevant query terms by the users. Simultaneously, the results suggest that developing a query generation tool for generating initial queries based on patent documents is feasible. Research limitations/implications - These preliminary but informative results need to be viewed in the light that only three patent experts were observed and that a small set of topics was used. Originality/value - It is usually difficult to get access to the setting of the patent domain and the results of the study show that the methodology provided a feasible way to study manual and the manual query formulation of the patent engineer.

机译：目的-本研究旨在检查专利“现有技术”搜索的早期阶段中手动制定的查询和自动查询生成。设计/方法/方法-这项研究部分是在专利领域中进行的，涉及三名专业专利审查员，另一部分是在CLEF 2009知识产权（CLEF-IP）跟踪范围内进行的。为了对基于用户的查询方式进行探索性研究，三名专利审查员执行了相同的三项模拟现实生活中的专利任务。为了自动生成查询，使用了一个基于RATF公式的简单术语加权算法。比较了手动和自动创建的查询，以分析哪些类型的关键字以及从哪些部分选择了专利文件。调查结果-对于用户提出的查询，发现专利文件的阅读顺序是特定的，并且时间有所不同。阅读和选择/排序术语时进行了注释和协作。排名术语比选择术语更难。对于自动制定的查询，发现仅在RATF中使用的术语频率不会完全近似于用户将哪些术语视为相关查询术语。同时，结果表明，开发用于基于专利文档生成初始查询的查询生成工具是可行的。研究的局限性/意义-鉴于只有三名专利专家并且只使用了一小部分主题，因此需要对这些初步而有益的结果进行研究。原创性/价值-通常很难访问专利领域的设置，研究结果表明，该方法为研究手册和专利工程师的手动查询公式提供了一种可行的方法。

著录项

来源
《The Journal of Documentation》 |2013年第6期|873-898|共26页
作者
Preben Hansen; Anni Jarvelin; Antti Jarvelin;
展开▼
作者单位

Department of Computer and Systems Sciences, Stockholm University,Kista, Sweden;

School of Information, University of Tampere, Tampere, Finland;

School of Information, University of Tampere, Tampere, Finland;

展开▼
收录信息美国《科学引文索引》(SCI);
原文格式 PDF
正文语种 eng
中图分类
关键词
Information searches; Knowledge processes; Patent information retrieval; Relevance assessment; Manual query formulation; Automatic query generation;

机译：信息搜索;知识过程;专利信息检索;相关性评估;手动查询公式;自动查询生成;

相似文献

外文文献
中文文献
专利

1. Learning keyword phrases from query logs of USPTO patent examiners for automatic query scope limitation in patent searching [J] . Wolfgang Tannebaum, Andreas Rauber World Patent Information . 2015,第JUNa期

机译：从USPTO专利审查员的查询日志中学习关键字词组，以在专利搜索中自动限制查询范围
2. Using query logs of USPTO patent examiners for automatic query expansion in patent searching [J] . Wolfgang Tannebaum, Andreas Rauber Information retrieval . 2014,第5a6期

机译：使用USPTO专利审查员的查询日志在专利检索中自动查询扩展
3. Exploring features for automatic identification of news queries through query logs [J] . Xiaojuan ZHANG, Jian LI 中国文献情报（英文刊） . 2014,第004期

机译：探索通过查询日志自动识别新闻查询的功能
4. Exploring Structured Documents and Query Formulation Techniques for Patent Retrieval [C] . Walid Magdy, Johannes Leveling, Gareth J.F. Jones Workshop of the cross-language evaluation forum;CLEF 2009;European conference on digital libraries . 2010

机译：探索专利检索的结构化文档和查询制定技术
5. RECURSIONS AND RULE SELECTIONS ON A HIGH LEVEL RELATION PROCESSOR FOR KNOWLEDGE-BASE MACHINE (ARTIFICIAL INTELLIGENCE, FIRST-ORDER QUERY, PROLOG, 'CUT' OPERATOR, FIFTH GENERATION COMPUTER) [D] . SHIN, DONGPIL 1986

机译：基于知识库的机器（人工智能，一阶查询，PROLOG，“剪切”运算符，第五代计算机）的高层次关系处理器上的递归和规则选择
6. Automatic query generation using word embeddings for retrieving passages describing experimental methods [O] . Ferhat Aydın, Zehra Melce Hüsünbeyi, Arzucan Özgür 2017

机译：使用单词嵌入自动查询生成以检索描述实验方法的段落
7. Exploring structured documents and query formulation techniques for patent retrieval [O] . Walid Magdy, Johannes Leveling, Gareth J. F. Jones 2010

机译：探索结构化文档和专利检索的查询制定技术

Exploring manual and automatic query formulation in patent IR Initial query construction and query generation process

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅