Cluster-based patent retrieval

In-Su Kang; Seung-Hoon Na; Jungi Kim; Jong-Hyeok Lee

首页> 外文期刊>Information Processing & Management >Cluster-based patent retrieval

【24h】

Cluster-based patent retrieval

机译：基于集群的专利检索

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Through the recent NTCIR workshops, patent retrieval casts many challenging issues to information retrieval community. Unlike newspaper articles, patent documents are very long and well structured. These characteristics raise the necessity to reassess existing retrieval techniques that have been mainly developed for structure-less and short documents such as newspapers. This study investigates cluster-based retrieval in the context of invalidity search task of patent retrieval. Cluster-based retrieval assumes that clusters would provide additional evidence to match user's information need. Thus far, cluster-based retrieval approaches have relied on automatically-created clusters. Fortunately, all patents have manually-assigned cluster information, international patent classification codes. International patent classification is a standard taxonomy for classifying patents, and has currently about 69,000 nodes which are organized into a five-level hierarchical system. Thus, patent documents could provide the best test bed to develop and evaluate cluster-based retrieval techniques. Experiments using the NTCIR-4 patent collection showed that the cluster-based language model could be helpful to improving the cluster-less baseline language model.

机译：通过最近的NTCIR研讨会，专利检索给信息检索界带来了许多具有挑战性的问题。与报纸文章不同，专利文件非常长且结构合理。这些特征提出了重新评估主要针对无结构和简短文档（例如报纸）开发的现有检索技术的必要性。本研究在专利检索的无效检索任务的背景下研究了基于聚类的检索。基于聚类的检索假定聚类将提供其他证据来匹配用户的信息需求。到目前为止，基于集群的检索方法已经依赖于自动创建的集群。幸运的是，所有专利都具有手动分配的簇信息，国际专利分类代码。国际专利分类是用于对专利进行分类的标准分类法，目前有约69,000个节点被组织为五级层次结构系统。因此，专利文献可以提供最佳的试验台，以开发和评估基于簇的检索技术。使用NTCIR-4专利集进行的实验表明，基于聚类的语言模型可能有助于改进无聚类的基线语言模型。

著录项

来源
《Information Processing & Management》 |2007年第5期|p.1173-1182|共10页
作者
In-Su Kang; Seung-Hoon Na; Jungi Kim; Jong-Hyeok Lee;
展开▼
作者单位

PIRL 323, Pohang University of Science and Technology, San 31, Hyoja-dong, Nam-gu, Pohang 790- 784, Republic of Korea;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类图书馆学、图书馆事业;情报学、情报工作;
关键词
cluster-based retrieval; patent retrieval; invalidity search; international patent classification;

机译：基于集群的检索;专利检索;无效检索;国际专利分类;
入库时间 2022-08-17 23:20:28

相似文献

外文文献
中文文献
专利

1. Determining patent filing targets based on patent cost retrieval from Patent Examination Data System [J] . Saurabh Mishra World Patent Information . 2021,第Juna期

机译：基于专利预测从专利检测数据系统确定专利归档目标
2. An Approach for Image Search and Retrieval by Cluster-Based Indexing of Binary MKSIFT Codes [J] . B. MATHAN KUMAR, R. PUSHPALAKSHMI The Computer journal . 2020,第6期

机译：基于群集的二进制MKSIFT代码索引的图像搜索和检索方法
3. Fast and effective cluster-based information retrieval using frequent closed itemsets [J] . Djenouri Youcef, Belhadi Asma, Fournier-Viger Philippe, Information Sciences: An International Journal . 2018,第期

机译：基于频繁封闭项目的基于基于群集的信息的快速有效的基于群集的信息
4. Cluster-Based Patent Retrieval Using International Patent Classification System [C] . Jungi Kim, In-Su Kang, Jong-Hyeok Lee Computer Processing of Oriental Languages: Beyond the Orient: The Research Challenges Ahead; Lecture Notes in Artificial Intelligence; 4285 . 2006

机译：使用国际专利分类系统的基于集群的专利检索
5. Cluster-based Query Expansion Using Language Modeling for Biomedical Literature Retrieval. [D] . Xu, Xuheng. 2011

机译：用于生物医学文献检索的使用语言建模的基于聚类的查询扩展。
6. Patent information retrieval: approaching a method and analysing nanotechnology patent collaborations [O] . Sercan Ozcan, Nazrul Islam -1

机译：专利信息检索：一种方法和分析纳米技术专利合作
7. Test Collections for Patent-to-Patent Retrieval and Patent Map Generation in NTCIR-4 Workshop [O] . Fujii, Atsushi, Iwayama, Makoto, Kando, Noriko 2004

机译：专利 - 专利检索和专利地图的测试集合在NTCIR-4研讨会上代

Cluster-based patent retrieval

摘要

著录项

相似文献

相关主题

期刊订阅