Multi-classification of Patent Applications with Winnow

机译：Winnow对专利申请的多分类

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

The Winnow family of learning algorithms can cope well with large numbers of features and is tolerant to variations in document length, which makes it suitable for classifying large collections of large documents, like patent applications. Both the large size of the documents and the large number of available training documents for each class make this classification task qualitatively different from the classification of short documents (newspaper articles or medical abstracts) with few training examples, as exemplified by the TREC evaluations. This note describes recent experiments with Winnow on two large corpora of patent applications, supplied by the European Patent Office (EPO). It is found that the multi-classification of patent applications is much less accurate than the mono-classification of similar documents. We describe a potential pitfall in multi-classification and show ways to improve the accuracy. We argue that the inherently larger noisiness of multi-class labeling is the reason that multi-classification is harder than mono-classification.

机译：Winnow系列学习算法可以很好地应对大量功能，并且可以容忍文档长度的变化，这使其适用于对大型文档的大集合进行分类，例如专利申请。每个班级的大量文档和大量可用的培训文档都使该分类任务与简短文档（报纸文章或医学摘要）的分类在质量上有所不同，而培训文档很少，例如TREC评估所示。本说明描述了由Winnow在欧洲专利局（EPO）提供的两个大型专利申请上进行的最新实验。发现专利申请的多分类比相似文献的单分类准确度低得多。我们描述了多重分类中的潜在陷阱，并显示了提高准确性的方法。我们认为，多类别标签固有的较大噪音是多类别比单类别更难的原因。

著录项

来源
《Perspectives of System Informatics》|2004年|P.546-555|共10页
会议地点
作者
Cornelis H. A. Koster; Marc Seutter; Jean Beney;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类信息处理（信息加工）;
关键词

相似文献

外文文献
中文文献
专利

1. A regularized pairwise multi-classification knowledge-based machine and applications [J] . Oladunni OO, Trafalis TB European Journal of Operational Research . 2009,第3期

机译：基于正则化的双向多分类知识的机器和应用
2. Changes to Practice for Continued Examination Filings, Patent Applications Containing Patentably Indistinct Claims, and Examination of Claims in Patent Applications; Final Rule [J] . Biotechnology Law Report . 2007,第6期

机译：对连续审查申请，包含明显含糊的权利要求的专利申请以及专利申请中的权利要求进行审查的做法的变更;最终规则
3. Report on the Status of National Patent Strength in 2017 Released 2018 Training Course of Patent System and Patent Examination Practicing Between the Countries along the Belt and Road Launched SIPO issued 2017 Research Report of Chinese Patent China Files Second Highest Number of PCT Applications in 2017 [J] . World Patent Information . 2018,第SEPa期

机译：2017年国家专利状况报告发布2018年``一带一路''沿线国家专利制度培训和专利实践培训课程上报国家知识产权局发布《 2017年中国专利研究报告》，2017年PCT申请量位居第二
4. Multi-classification of Patent Applications with Winnow [C] . Cornelis H. A. Koster, Marc Seutter, Jean Beney International Andrei Ershov Memorial Conference . 2003

机译：WinNow的专利申请多分类
5. Dynamic language bridges and applications in LED patent citation analysis. [D] . Pringle, Benjamin. 2014

机译：动态语言桥梁及其在LED专利引用分析中的应用。
6. Patenting of University and Non-University Public Research Organisations in Germany: Evidence from Patent Applications for Medical Research Results [O] . Peter Tinnemann, Jonas Özbay, Victoria A. Saint, 2010

机译：德国大学和非大学公共研究组织的专利：医学研究结果专利申请的证据
7. Multi-classification of Patent Applications with Winnow [O] . Cornelis H. A. Koster, Marc Seutter, Jean Beney 2003

机译：Winnow对专利申请的多分类

Multi-classification of Patent Applications with Winnow

摘要

著录项

相似文献

相关主题

期刊订阅