首页> 外文会议>International Chemical Information Conference; 20031019-20031022; Nimes; FR >Computer-assisted categorisation of patent documents in the International Patent Classification
【24h】

Computer-assisted categorisation of patent documents in the International Patent Classification

机译:国际专利分类中专利文件的计算机辅助分类

获取原文
获取原文并翻译 | 示例

摘要

The World Intellectual Property Organisation is currently developing a system for assisting users in categorising patent documents in the International Patent Classification (IPC). The system should support the classification of documents in several languages and aims to assist users in locating relevant IPC symbols by providing them with a convenient web-based service. The approach taken for developing such a system relies on powerful machine learning algorithms that are trained on manually classified documents to recognise IPC topics. We detail in-house results of applying a custom-built state-of-the-art computer-assisted cate-goriser to English, French, Russian, and German-language patent documents. We find that reliable computer-assisted categorisation at IPC subclass level is an achievable goal for the statistical methods employed here. A categorisation system suggesting three IPC symbols for each document can predict the main IPC class correctly for around 90% of documents, and the main IPC subclass for about 85% of documents. The accuracy of the system at main group level is enhanced if the user first validates the correct IPC class.
机译:世界知识产权组织目前正在开发一种系统,以帮助用户对国际专利分类(IPC)中的专利文件进行分类。该系统应支持多种语言的文档分类,并旨在通过为用户提供方便的基于Web的服务来帮助用户查找相关的IPC符号。开发此类系统所采用的方法依赖于强大的机器学习算法,该算法在手动分类的文档上进行了训练,以识别IPC主题。我们详细介绍了对英语,法语,俄语和德语专利文件应用定制的,最先进的计算机辅助分类器的内部结果。我们发现,在IPC子类级别进行可靠的计算机辅助分类是此处采用的统计方法的可实现目标。为每个文档建议三个IPC符号的分类系统可以正确预测约90%的文档的主要IPC类,以及约85%的文档的主要IPC子类。如果用户首先验证正确的IPC类,则可以提高主组级别系统的准确性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号