IPC Multi-label Classification Applying the Characteristics of Patent Documents

机译：IPC多标签分类应用专利文献的特征

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Most of research on the IPC automatic classification system has focused on applying various existing machine learning methods to the patent documents rather than considering the characteristics of the data or the structure of the patent documents. This paper, therefore, proposes using two structural fields, a technical field and a background field which are selected by applying the characteristics of patent documents and the role of the structural fields. A multi-label classification model is also constructed to reflect that a patent document could have multiple IPCs and to classify patent documents at an IPC subclass level comprised of 630 categories. The effects of the structural fields of the patent documents are examined using 564,793 registered patents in Korea. An 87.2 % precision rate is obtained when using the two fields mainly. From this sequence, it is verified that the technical field and background field play an important role in improving the precision of IPC multi-label classification at the IPC subclass level.

机译：关于IPC自动分类系统的大多数研究都集中在将各种现有机器学习方法应用于专利文献，而不是考虑数据的特征或专利文献的结构。因此，本文提出了通过应用专利文献的特征和结构领域的作用来选择的两个结构领域，技术领域和背景领域。还构造了多标签分类模型，以反映专利文献可以具有多个IPC，并在由630类组成的IPC子类级别分类专利文档。使用韩国的564,793注册专利检查专利文献结构领域的效果。使用两个字段主要获得87.2％的精确率。从这个序列中，验证了技术领域和背景领域在提高IPC子类级别的IPC多标签分类的精度方面发挥着重要作用。

著录项

来源
《International conference on computer science and it applications》|2017年|xxxvi 1113 p.|共7页
会议地点
作者
Sora Lim; YongJin Kwon;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类计算技术、计算机技术;
关键词
Patent classification; IPC classification; Patent document fields; Data characteristics; Multi-label classification;

机译：专利分类;IPC分类;专利文献字段;数据特征;多标签分类;
入库时间 2022-08-20 23:18:20

相似文献

外文文献
中文文献
专利

1. Mapping Iranian patents based on International Patent Classification (IPC), from 1976 to 2011 [J] . Alireza Noruzi, Mohammadhiwa Abdekhoda Scientometrics . 2012,第3期

机译：1976年至2011年根据国际专利分类（IPC）映射伊朗专利
2. Analysis of a database of public domain Brazilian patent documents based on the IPC [J] . Wanise B.G. Barroso, Luc Quoniam, Jose Angelo R. Gregolin, World Patent Information . 2003,第1期

机译：基于IPC的巴西公共领域专利文件数据库的分析
3. Study on Multi-Label Classification of Medical Dispute Documents [J] . Baili Zhang, Shan Zhou, Le Yang, Computers, Materials & Continua . 2020,第3期

机译：医学纠纷文件多标签分类研究
4. IPC Multi-label Classification Applying the Characteristics of Patent Documents [C] . Sora Lim, YongJin Kwon International conference on computer science and it applications;International conference on ubiquitous information technologies . 2017

机译：运用专利文件特征的IPC多标签分类
5. Automated patent classification for German patent documents. [D] . Zakaria, Saiedeh. 1989

机译：德国专利文件的自动专利分类。
6. A Spacecraft Electrical Characteristics Multi-Label Classification Method Based on Off-Line FCM Clustering and On-Line WPSVM [O] . Ke Li, Yi Liu, Quanxin Wang, -1

机译：基于离线FCM聚类和在线WPSVM的航天器电气特性多标签分类方法
7. The effect of knowledge convergence characteristics on firm’s innovation performance via International Patent Classification(IPC) co-occurrence network analysis - Focused on Electricity and Electronic SMEs - [O] . Minjung Lee, Changhyeon Song, Yeonbae Kim 2018

机译：知识融合特征对公司创新绩效的影响通过国际专利分类（IPC）共同发生网络分析 - 专注于电力和电子中小企业 -

IPC Multi-label Classification Applying the Characteristics of Patent Documents

摘要

著录项

相似文献

相关主题

期刊订阅